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Preface 


It is now well known that Fermat’s last theorem has been proved. For 
more than three and a half centuries, mathematicians — from the 
great names to the clever amateurs — tried to prove Fermat’s famous 
statement. The approach was new and involved very sophisticated 
theories. Finally the long-sought proof was achieved. The arithmetic 
theory of elliptic curves, modular forms, Galois representations, and 
their deformations, developed by many mathematicians, were the 
tools required to complete the difficult proof. 

Linked with this great mathematical feat are the names of TANI- 
YAMA, SHIMURA, FREY, SERRE, RIBET, WILES, TAYLOR. 
Their contributions, as well as hints of the proof, are discussed in 
the Epilogue. This book has not been written with the purpose of 
presenting the proof of Fermat’s theorem. On the contrary, it is writ- 
ten for amateurs, teachers, and mathematicians curious about the 
unfolding of the subject. I employ exclusively elementary methods 
(except in the Epilogue). They have only led to partial solutions 
but their interest goes beyond Fermat’s problem. One cannot stop 
admiring the results obtained with these limited techniques. 

Nevertheless, I warn that as far as I can see — which in fact is 
not much — the methods presented here will not lead to a proof of 
Fermat’s last theorem for all exponents. 


vi Preface 


The presentation is self-contained and details are not spared, so 
the reading should be smooth. 

Most of the considerations involve ordinary rational numbers and 
only occasionally some algebraic (non-rational) numbers. For this 
reason I excluded Kummer’s important contributions, which are 
treated in detail in my book, Classical Theory of Algebraic Num- 
bers and described in my 13 Lectures on Fermat’s Last Theorem 
(new printing, containing an Epilogue about recent results). 

There are already — and there will be more — books, mono- 
graphs, and papers explaining the ideas and steps in the proof of 
Fermat’s theorem. The readers with an extended solid background 
will profit more from reading such writings. Others may prefer to 
stay with me. 

In summary, if you are an amateur or a young beginner, you may 
love what you will read here, as I made a serious effort to provide 
thorough and clear explanations. 

On the other hand, if you are a professional mathematician, you 
may then wonder why I have undertaken this task now that the 
problem has been solved. The tower of Babel did not reach the 
sky, but it was one of the marvels of ancient times. Here too, there 
are some admirable examples of ingenuity, even more remarkable 
considering that the arguments are strictly elementary. It would be 
an unforgivable error to let these gems sink into oblivion. As Jacobi 
said, all for "l'honneur de l'esprit humain." 


August, 1997 Paulo Ribenboim 


Reader 


You may feel tempted to write your own (simpler) proof of Fermat’s 
last theorem. 

I have strong views about such a project. It should be written in 
the Constitution of States and Nations, in the Chapter of Human 
Rights: 

It is an inalienable right of each individual to produce his or her 
own proof of Fermat’s last theorem. 

However, such a solemn statement about Fermat’s last theorem 
(henceforth referred to as THE theorem) should be tempered by the 
following articles: 


Art. 1. No attempted proof of THE theorem should 
ever duplicate a previous one. 


Art. 2. It is a criminal offense to submit false proofs 
of THE theorem to professors who arduously earn 
their living by teaching how not to conceive false proofs 
of THE theorem. 


Infringement of the latter, leads directly to Hell. Return to Par- 
adise only after the said criminal has understood and is able to re- 
produce Wiles’ proof. (Harsh punishment.) 
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The Problem 


In the margin of his copy of Bachet’s edition of the works of Dio- 
phantus, ! Format wrote: 

“Tt is impossible to separate a cube into two cubes, or a biquadrate 
into two biquadrates, or in general any power higher than the second 
into powers of like degree; I have discovered a truly marvelous proof, 
which this margin is too small to contain.” 

In modern language, this means: 


If n is any natural number greater than 2, the equation 


has no solutions in integers, all different from 0 (i.e., it has only the 
trivial solution, where one of the integers is equal to 0). 


The above statement has been called Fermat’s last theorem, or 
conjecture, or problem. 

We begin with the following remarks. 

In order to prove Fermat’s theorem for all exponents greater than 
2, it suffices to prove it for the exponent 4 and every odd prime 


1This copy is now lost, but the remarks appeared in the 1679 edition of 
the works of Fermat, edited in Toulouse by his son Samuel de Fermat. 


2 The Problem 


exponent p. Indeed, if n is composite, n > 2, it has a factor m which 
is 4 or an odd prime p. If the theorem fails for n — ml, where m — 4 
or p, l > 1, if x,y,z are non-zero integers such that r" + y" = z^ 
then (im + (y')™ = (z')™ and the theorem would fail for m. 

Occasionally, we shall also indicate some results and proofs for 
even exponents or prime-power exponents. 

The following general remarks are quite obvious and henceforth 
will be taken for granted. 

If n is odd then X” + Y” = Z” has a non-trivial solution if and 
only if X” + Y” + Z” = 0 has a non-trivial solution. 

If x,y,z are non-zero integers such that r" + y" = z”, if d = 
gcd(z,y,z) and pn = x/d,y = y/d,21 = z/d then x? + y? = zf, 
where the non-zero integers 21, yi, 21 are pairwise relatively prime. 
So, if we assume that Fermat’s equation has a non-trivial solution, 
then it has one with pairwise relatively prime integers. 

Moreover, if z, y, z are non-zero pairwise relatively prime integers 
such that x" + y" = z” then x + y,z — x,z — y are also pairwise 
relatively prime. Indeed, if a prime p divides x + y and z — x then 
x = z (mod p), hence à" = z” = x” + y" (mod p) therefore y^ = 0 
(mod p), so p divides y, and since p divides x + y then p divides z, 
which is contrary to the hypothesis. This shows that gcd(x + y, z — 
x) = 1. In a similar way, we may show that gcd(r + y,z — y) = 1 
and gcd(z — z,z — y) = 1. 

Following tradition, we say that the first case of Fermat’s theorem 
is true for the odd prime exponent p when: if x,y,z are (non-zero) 
integers, not multiples of p, then x? + y? 4 zf, 

The second case is true for the odd prime exponent p when: if 
mu. 5 are non-zero pairwise relatively prime integers, and p divides 
xyz then x? + y? Z z?. As said above, in this case p divides one and 
only one of the integers x,y, z. 

More generally, for an arbitrary integer n = 2"m, u > 0, m odd, 
we say that the first case of Fermat’s theorem is true for the exponent 
n when: if x,y,z are (non-zero) integers and gcd(m, xyz) = 1 then 
arty" £ 27. 

Similarly, the second case is true for the exponent n when: if x, y, z 
are (non-zero) pairwise relatively prime integers and gcd(m, xyz) 4 1 
then x" + y" #2”. 


I 
Special Cases 


This chapter is devoted to the proof of special cases of Fermat’s 
theorem: exponents 4, 3, 5, and 7. However, we begin by considering 
the exceptional case of exponent 2. 


I.1. The Pythagorean Equation 


We study briefly the Pythagorean equation 
(1.1) X +Y’ =Z. 


A triple (x,y,z) of positive integers such that z? + y? = 2? is called 
a Pythagorean triple, for example, (3, 4,5) since 3? + 4? = 5°. 

If x,y,z are nonzero integers such that x? + y? = z? then |z], 
(ul, |z| also satisfy the same equation. Note that x, y cannot be both 
odd, otherwise 2? = 1+1 (mod 4), which is impossible. Moreover, if 
d = gcd(z, y, z) then x/d, y/d, z/d also satisfy the equation. Thus, it 
suffices to determine the primitive solutions (x, y, z) of (1.1), namely 
those such that x > 0, y > 0, z > 0, x is even, and gcd(z, y, z) = 1, 
hence y and z are odd. 

It is stated in Dickson's (1920) History of the Theory of Numbers, 
Vol. IL, pp. 165-166, that Pythagoras and Plato gave methods to 
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find solutions of equation (1.1). In Lemma 1 to Proposition 29 of 
Book X of The Elements, Euclid gave a geometric method to find 
solutions of (1.1). 

Diophantus indicated how to find all solutions, as in the following 
result. 

Leonardo di Pisa (Fibonacci) also gave in 1225 an interesting 
method to find solutions. 


(LA) Jfa,b are integers such that a > b > 0, gcd(a, b) = 1, a,b of 
different parity, then the triple (x,y,z), given by 


x = 2ab, 
ya -— b, 
z = a? + b?, 


is a primitive solution of (1.1). This establishes a one-to-one corre- 
spondence between the set of pairs (a,b) satisfying the above condi- 
tions, and the set of primitive solutions of (1.1). 


PROOF. If a,b are integers satisfying the conditions of the statement, 
let x,y, z be defined as indicated. Then 

x? +y’ = 4a2b? + (a? = py = (a? +) =: 22. 
Clearly x > 0, y > 0, z > 0, x is even, and gcd(z, y, z) = 1 because 
if d divides x, y, and z then d divides 2a? and 2b°, so d = 1 or d = 2 
(since gcd(a, Bie = 1); but d Æ 2 because y is odd (a,b do not have 
the same parity). 

Different pairs (a,b) give different triples (x, y, z). 

Conversely, let (x, y, z) be a primitive solution of (1.1), so z?4-y? = 
z?. From gcd(x, y, z) = 1 we have ged(z, z) = 1. Since x is even then 
z is odd hence ged(z — z,z + x) = 1. Since y? = (z — x)(z + x), it 
follows from their decomposition into prime numbers that z—2, z+x 
are squares of integers, say z + x = t, z — x = u?, and t,u must be 
positive odd integers, with t > u > 0. Let a,b be integers such that 
2a = t+ u, 2b = t — u, hence t = a +b, u = a — b with a > b > 0. So 


z = ((a +b)? — (a — b)?)/2 = 2ab, 
y? = ut = (a — b) (a +b)? = (a? — 0’)? so y—-a?—VW, 
z =((a+b)? + (a — 0)?)/2 = a? + b. 
We note that gcd(a, b) = 1 because gcd(z — z, z + x) = 1 and finally 
a+ b= t is odd so a,b are not both odd. 
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For example, the smallest primitive solutions for (1.1), ordered 
according to increasing values of z, are the following: 


(4, 3,5), (12,5,13), (8,15,17), (24,7, 25), 
(20,21,29), (12,35,37), (40,9,41), (28, 45,53), 
(60,11,61), (56,33,65), (16,63,65), (48,55, 73). 


In view of (1A), to find the primitive solutions of (1.1) amounts 
to determining which odd positive integers are sums of two squares, 
and in each case, to write all such representations. Fermat proved: 
n > is a sum of two squares of integers if and only if every prime 
factor p of n, such that p = 3 (mod 4), appears to an even power 
in the decomposition of n into prime factors (see the proof below). 
For every integer n which is the sum of two squares of integers, let 
r(n) be the number of ordered pairs (a,b) such that a? +b? = n, a,b 
integers not necessarily positive. For example, r(1) — 4, r(5) — 8. It 
was proved by Jacobi, and independently by Gauss, that 


r(n) = 4(difn) — d3(n)), 


where d; (n) (respectively, d3(n)) is the number of divisors of n which 
are congruent to 1 modulo 4 (respectively, congruent to 3 modulo 4) 
(see Hardy and Wright (1938, p. 241)). 

With this information, it is possible to determine explicitly the 
primitive Pythagorean triples (x, y, z). Now we paraphrase Fermat's 
proof which is of historical importance. We begin with a very easy 
identity: 


(1.2) (a? + b’)(c? +d’) = (ac + bd)? + (ad — bc)? 
= (ac — bd}? + (ad + bc)?. 


Now we show 


(1B) A prime number p is a sum of two squares if and only if 
p=2 or p= 1 (mod 4). 


PROOF. If p Z 2 and p = a? + b?, then a,b cannot both be even 
— otherwise 4 divides p. If a,b are both odd, then p = 1 +1 = 2 
(mod 4), since every odd square is congruent to 1 modulo 4. Thus 
p = 2. If, say, a is odd and b is even, then p = 1 +0 — 1 (mod 4). 
Conversely, 2 = 1? + 1?, so let p = 1 (mod 4). From the the- 
ory of quadratic residues, —1 is a square modulo p, so there exists 
x,1 <x <p—1, such that z? + 1 = 0 (mod p), thus z? +1 = 
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mp, with 1 < m € p—1. Hence the set {m | 1 < m € p— 
1, such that mp = x? + y? for some integers x, y} is not empty. Let 
mo be the smallest integer in this set, so 1 < mp < p — 1. We show 
that mo = 1, hence p is a sum of two squares. Assume, on the 
contrary, that 1 < mp. We write 


{ £ = CMo + Tı, 
y = dmo +y, 


with —mo/2 < z1, y1 € Mo/2, and integers c,d. We observe that x; 
or yı is not 0. Otherwise mê divides x? +y? = mop, hence mo divides 
p, thus mo = p, which is absurd. We have 0 < x? + y? € m2/A 4 
m2/4 = m2/2 < m2 and x? + y? = x? + y? = 0 (mod mo). Hence, 
xi +y? = mom’ with 1 < m < mo. But mop = z? + y^, mom! = 
ti yi, hence mgm'p = (a?--y?)(zi- yi) = (xai yi)? (2 —9mi). 
We also have 


mn + YY = «(x — cmo) + y(y — dmo) 
= (x? + y?) — mo(c + yd) 
= Mot, 

ry, — up = z(y — dmo) — y(x — cmo) 
= —mo(zd — yc) 


= Mou 


for some integers t, u. Hence m'p = t? +u?, with 1 < ml < mo. This 
is a contradiction and concludes the proof. 


(1C) A natural number n is the sum of two squares of integers if 
and only if every prime factor p of n, such that p = 3 (mod 4), ap- 
pears to an even power in the decomposition of n into prime factors. 
PROOF. Let n = př- p and assume that k; is even if p; = 3 
(mod 4). Then n = nin, where no > 1, nı > 1, and m, is the 
product of distinct primes which are either equal to 2, or congruent 
to 1 modulo 4. By (1B), each factor of n; is a sum of two squares; 
by the identity indicated in (1.2), n; and therefore also n, is a sum 
of two squares. Conversely, let n = x? + y?; the statement is trivial 
if æ = Oory = 0. Let x,y be nonzero, let d = gcd(z,y), so d? 
divides n. Let n = d?n', x = dz’, y = dy’, hence gcd(z', y) = 1 and 
n = x? -- y". If p divides n’, then p does not divide x’ — otherwise 
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p would also divide y’. Let k be such that kx’ = y (mod p). Then 
a”? + y^? = UL k?) = 0 (mod p). Thus p divides 1 + k?, that 
is, —1 is a square modulo p, so p = 2 or p = 1 (mod 4), by the 
theory of quadratic residues. It follows that if p; = 3 (mod 4) then 
p; does not divide n’, hence p; divides d, so the exponent k; must 
be even. 


It is customary to say that a right triangle is a Pythagorean tri- 
angle when its sides are measured by integers a,b,c. If c is the 
hypothenuse, then c? = a? + b?. See also Mariani (1962). 

On this matter, we recommend Shanks’ book (1962) which con- 
tains an interesting chapter on Pythagoreanism and its applications, 
as well as the book by Sierpiński (1962). 

In 1908, Bottari gave another parametrization for the solutions of 
(1.1). The following simpler proof is due to Cattaneo (1908): 


(1D) ` (ob are odd natural numbers such that gcd(a, 0) = 1, if 
s > 1 then the triple (x,y,z) given by 

x = 2?5-1g? + 25 ab, 

y = b? + 25 ab, 

z = 2257102 + b? + 2% ab, 
is a primitive solution of (1.1). This establishes a one-to-one cor- 


respondence between the set of triples (a,b, sl satisfying the above 
conditions and the set of primitive solutions of (1.1). 


PROOF. It is clear that if x,y,z are defined as indicated then the 


triple (z, y, z) is a primitive solution of (1.1). 
Different triples (a,b, s) give rise to different primitive solutions 


(x,y,z), because 
b = z-2, 
2925-192 = Z—Y, 


2?ab = x +y— 2. 


Finally, if (x,y,z) is a primitive solution, 0 < x < z,0 < y < 
z, and z < x 4 y, because 2? = x? + y? < (x + y)?. We write 


T—=Z—U, 
y—2-—U, 
z—rcty-—u, 
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with u,v,w > 0. Then 


T—=UV+W, 
y=u+w, 
z=u+v+w. 


From x? + y? = z? it follows that w? = 2uv, hence w is even. Since 
gcd(u, v, w) = 1 and x is even and y is odd then v is even and u is 
odd. Let w = 2*w', v = 2'v', where v', w are odd, s > 1,1 > 1. 
Then 2?5u/? = 2u.2*v' sot = 2s—1 and w^? = wv’ with gcd(u, v’) = 1. 
Hence necessarily u,v’ are squares: u = b?, v' = a”, and therefore 
x = 275-19? + 2*ab, y = b? + 2?ab, z = 22 la? + b? + 2*ab. 


It is also interesting to describe the solutions of 


(1.3) X?’ +Y?’ =1. 


The solutions in integers are just (+1,0), (0, +1). 

We shall consider the solutions in rational numbers as well as, for 
each prime p, the solutions in the field with p elements. 

Let Q denote the field of rational numbers. For each prime p, let 
F, be the set {0,1,... , p — 1} of residue classes of Z modulo p. So, 
if a,b € Z, we have à = b if and only if a,b have the same remainder 
when divided by p. The operations of addition and multiplication 
in F, are defined as follows: T + Y = x +y, TY = Ty. With these 
operations, which satisfy the usual properties, F, becomes a field: if 
a € F, and à Æ 0, we have gcd(a, p) = 1, so there exist r,s € Z such 
that ar + ps = 1; then ar = 1. So r is the inverse of à in F,. For 
simplicity, we may use the notation x instead of x for the elements 
of F,. We shall indicate a result that is valid for Q as well as for 
each field F, for p > 2. Thus, let F = Q or F, (for p > 2). (More 
generally, F may be taken to be any field of characteristic different 
from 2, that is, 1+ 1 Z 0 in the field F.) 

Let oo bea symbol, oo ¢ F, and let T = (oo]U(t € F | 12-0? 4 0). 
Let S = Sp = ((z,y) € F x F | zx? +y? = 1). So the elements of S 
are the solutions of (1.3) in the field F. 

Let y : T —^ F x F be the following mapping: 


| (oo) = (0, —1), 


2 1-# 


(1.4) ift € F then y(t) = ( ) | 


UON] 2 
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We note that since 1+t? Æ 0, then 1+? is invertible, so the mapping 
y is well defined. 


(1E) With the above notations, y is a one-to-one mapping from 


] onto S. 
( 2 ) ( SH 
1 2 1 2 2 


then y(t) € S for each t € F such that 1 +t? Z Also, (0, —1) € S, 
so p(T) C S. 

Ift € F, 14-0? Z 0 then (1 — £2)/(14- t?) Z —1, because 1 +1 Z 0. 
Also, if t,t2 € F, 13-02 z: 0, 1-02 Z 0 then v(t) Z (t2), as is 
easily seen. Thus the mapping ¢ is one-to-one (because 1 4- 1 Z 0). 

Now we show that (T) = S. Clearly, (0, —1) = (oo). Let 
(x,y) € S, (x,y) Z (0, —1). If x = 0 then y = 1 and (0,1) = (0). If 
x Æ 0let t = (1— y)/z, so 


PROOF. Since 


2(1 — 2 — 1 
E SH L-Pe CR K 
x x 
and 
an bapo 
Igp ot pan 


hence (z, y) = y(t), concluding the proof. 


If F = Q then 14- t? £0 for allt € Q, so T = QU {oo}. If F = F, 
with p > 2 then 1+ t? = 0 if and only if —1 is a square modulo 
p. According to the result of Fermat already quoted, —1 is a square 
modulo p > 2 if and only if p=1 (mod 4). 

Let N, denote the number of elements of Sg,. We have 


(1F) N,=2 and if p> 2 then 


N.= p—1 when p=1 (mod 4), 
P | p+1 when p=—1 (mod 4). 


PROOF. Sp, = {(0,1),(1,0)}, so M = 2. Let p > 2. Ifp=1 
(mod 4), then there exist two elements #1, t such that t?+1 = t2--1 = 
0. So Z(T) = (p — 2) + 1 = p — 1, hence by (IE), #(S) = p — 1. 
Similarly, if p = —1 (mod 4), then #(T) = p+ 1 and #(S) = 
HERR 
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1621 


1676 


1863 


1908 


1908 


1915 


1920 


1938 


1962 


1962 


1962 


1972 
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1.2. The Biquadratic Equation 


Now we take up the case n — 4. Fermat considered the problem 
of whether the area of a Pythagorean triangle may be the square of 
an integer (observation to Question 20 of Diophantus, Book VI of 
Arithmetica). 

He was led to study the equation 


(2.1) X*-y*-Z2 
and he showed (date unknown): 


(2A) Equation (2.1) has mo solutions in integers all different 
from 0. 


PROOF. If the statement is false, let (x,y,z) be a triple of positive 
integers with smallest possible x, such that z^ — y^ = z?. Then 
gcd(z, y) = 1, because if a prime p divides both x,y then p* divides 
22, so p? divides z; letting x = px’, y = py’, z = p?z' then a^ — y^ = 
z’, with 0 < 2’ < x, which is contrary to the hypothesis. 

We have 2? = gt — y* = (a? +y?) (a? — y?) and ged(z? + y?, x? — y?) 
is equal to 1 or 2, as is easily seen, because gcd(r,y) = 1. We 
distinguish two cases. 


Case 1: ged(z? + y?, 3? — y?) — 1. 

Since the product of z?--y?, z?—9? is a square then z?4-3?, z?2— 4? 
are squares; more precisely, there exist positive integers s, t, gcd(s, t) 
— ] such that 


x? + y? = 82, 
rL -y =t. 


It follows that s,t must be odd (since 2z? = s? + t? then s,t have 
the same parity and they cannot both be even). 
So there exist positive integers u,v such that 


pa 
v= (s— 1/2, 


and necessarily gcd(u, v) = 1, because s,t are odd. 


12 I. Special Cases 


We have uv = (s?—t?)/4 = y?/2 hence y? = 2uv. Since ged(u, v) = 
1 then there exist positive integers l,m such that 


u = 2l, a u= P, 

v= m?, v = 2m’. 
We just consider the first alternative, the other one being analogous. 
So u is even, gcd(u, v, x) = 1, and 


t? —1Y 2 t? 
gag (6i Go dE 


4 2 


It follows from (1A) that there exist positive integers a,b, 0 < b « a, 
gcd(a, b) = 1, such that 


27 = u = 2ab, 
m? ewe =F, 


x = a? + b?, 
hence |? = ab. Thus there exist positive integers c,d, gcd(c, d) = 1, 


such that 
a= @, 
b = æ, 


and so m? = c* — dl. We note that 0 < c < a < x and the triple of 
positive integers (c, d, m) would be a solution of the equation, which 
is contrary to the choice of x as smallest possible. 
Case 2: gcd(a? + y?, x? — y?) — 2. 

Now z,y are odd and z is even. By (1A) there exist positive 
integers a, b, 0 < b < a, gcd(a, b) = 1, such that 


a = a2 +b, 
y? a? — b?, 


z — 2ab. 


Hence z?y? = oi — b* with 0 < a < x and this is contrary to the 
choice of x as smallest possible. 


'The above argument is called the method of infinite descent and 
was invented by Fermat. It may also be phrased as follows: if 
(zo, Yo, zo) were a solution in positive integers of (2.1) then we would 
obtain a new solution in positive integers (z1,91,21) with zi < zo. 
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Repeating this procedure, we would produce an infinite decreasing 
sequence of positive integers 


Z > ën >see 


which is not possible. 

As a corollary, we obtain the original statement of Fermat, pro- 
posed as a problem or mentioned in letters to Mersenne [for Sainte- 
Croix] (September 1636), to Mersenne (May ?, 1640), to Saint- 
Martin (May 31, 1643), to Mersenne (August 1643), to Pascal (25 
September 1654), to Digby [for Wallis] (April 7, 1658), to Carcavi 
(August 1659): 


(2B) The area of a Pythagorean triangle is not the square of an 
integer. 


PROOF. Let a,b,c be the sides of the Pythagorean triangle, where c 
is the hypotenuse. So c? = a? + b?. 
Assume that the area is the square of an integer s: ab/2 = s?. 
Then 
(a+b)? = c? +4s?, 
{ (a — b)? = c? —4s?. 
Hence (a? — b?)? = ct — (2s)*, so the equation X* — Y* = Z? would 
have nontrivial solution in integers, contradicting (2A). 


We also state explicitly (this is proposed as a problem or men- 
tioned in letters to Mersenne [for Sainte-Croix] (September, 1636), 
to Mersenne (1638), to Mersenne (May ?, 1640)): 


(2C) The equation 
(2.2) Ke Kata 
has no solution in integers, all different from 0. 


PROOF. If x,y,z are nonzero integers such that zi + y* = z* then 
z* — y^ = (x), which contradicts (2A). 


The above results were also reproduced by Euler (1770) and Leg- 
endre (1808, 1830). 

A companion result to (2A) is the following (see the explicit proof 
by Euler, 1770): 
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(2D) The equation 
(2.3) XHY 
has no solution in integers all different from 0. 


PROOF. If the statement is false, let (x,y,z) be a triple of positive 
integers, with smallest possible z, such that z*--y* = z?. As in (2A), 
we may assume gcd(z, y) = 1. We also note that x, y cannot be both 
odd, otherwise z? = zf + y? = 2 (mod 4) and this is impossible. So 
we may, for example, assume x to be even. 

From (z?)? + (y?)? = z? it follows that (z?,y?, z) is a primitive 
solution of (1.1). By (1A), there exist integers a,b, such that a > 
b > 0, gcd(a, b) = 1, a,b are not both odd and 


x? = 2ab, 
y? = a? — b, 
z —a +b. 
Moreover, b must be even. For if b is odd, then a is even, y? = 
a? — b? = —1 (mod 4), which is impossible. 
Now we consider the relation 0? 4-y? = a?, where y, b, a are positive 


integers, b is even, and gcd(b, y, a) = 1. By (1A), there exist integers 
c, d such that c > d > 0, gcd(c, d) = 1, c,d of different parity and 


b = 2cd, 
y Sead, 
a = e+e. 


Therefore x? = 2ab = 4cd(c? + d’). But c,d, c? + d? are pair- 
wise relatively prime. By the decomposition of x? into primes, we 
conclude that c,d, c? + d? are squares of positive integers, say: 


Hence 
(2.4) pt+ge=r, 


that is, the triple (p,q,r) is a solution of (2.3). But z = a? + b? = 
(c? +d’)? Ad? > r > r (sincer > 1). This contradicts the choice 
of z as minimal possible, and concludes the proof. 
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TABLE 1. FLT for the exponent 4. 


Author Year 

Frénicle De Bessy 1676 

Euler 1738 (publ. 1747), 1771 
Kausler 1795/6 (publ. 1802) 
Barlow 1811 

Legendre 1823, 1830 
Schopis 1825 

Terquem 1846 

Bertrand 1851 

Lebesgue 1853, 1859, 1862 
Pepin 1883 
'Tafelmacher 1893 

Bendz 1901 

Gambioli 1901 

Kronecker 1901 

Bang 1905 

Bottari 1908 

Rychlik 1910 

Nutzhorn 1912 
Carmichael 1913 

Vranceanu 1966 


Other proofs of Fermat’s theorem for the exponent 4 are given by 
the authors listed in Table 1. Now we indicate a statement which is 
equivalent to Fermat’s theorem for the exponent 4 (see Vränceanu, 
1979): 


(2E) The following statements are equivalent: 


(1) Fermat’s last theorem is true for the exponent 4. 
(2) For every integer m # 0 the only solutions in nonzero inte- 
gers of 2X* = mY (m? + Y?) are (m,m) and (—m,m). 


PROOF. (1) — (2) Let m #0 and assume that there exist nonzero 
integers u, t such that 2u* = mt(m?--1?). Let x = 2u, y = t-m, z= 
t+m. Then z* — y* = (z-y)(z+y)(2? + y?) = 2m- 2t(2t? + 2m?) = 
8mt(t? + m?) = 16u* = x“. 

By hypothesis, xyz = 0. If x = 0 then y = +z hence m = 0, 
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contrary to the hypothesis. If y = 0 then t = m, x = +z, so u = +m. 
If z = 0 then x = 0, again contrary to the hypothesis. 

(2) — (1) Ifat+y* = z^ then 224 = 2(z* — yt) = 2(z — y)(z + 
y)(z? y?) = (z—y)(z+y) [(2 — y)? + (s + yy]. So taking m = 2—y 
then t = z +y, u = x satisfy the relation 2u* = mt(m? 4-12). If m or 
u or t is equal to 0 then x = 0. If m,t,u Æ 0, by hypothesis t = m 
hence y = 0, an absurdity. 


We conclude this section by illustrating how the method may be 
applied to find solutions of some similar diophantine equations. 


(2F) The equation 
X* - AY! = +7? 


has no solution in nonzero integers. 


PROOF. It suffices to consider the equation X^ — AY^ = Z?. Because 
if x,y,z are nonzero integers such that z^ — 4y* = —z? then 4z* — 
(2y)* = —(2z)?, so (2y)* — Ax* = (2z)? and (2y,2,2z) would be a 
solution of the first equation. 

Now, if x,y,z are positive integers such that x^ — 4y* = z? and 
gcd(z, y, z) — 1 (as we may assume without loss of generality), there 
exist integers a, b with a > b > 0 and 


2y? — 2ab, 
SM — b, 
r? =a? +b. 


Since gcd(a, b) = 1 then a,b are squares, say a = c?, b = d?. Hence 
x? = cl +d", and this relation is impossible by (2D). 


Legendre proved: 


(2G)  lfz,y,z are nonzero integers and 
at + yt = 222, 

then x? = y? and 2? = x“. 

PROOF. We have 


434 Z (ri Ty) E (xt = y^ + Ax ^y, 
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SO 


4 AN 2 
A Au IEN 
Aay (=) 


(in particular, z^ — y* is even). Since z, y, z are not zero, by (2A) we 
have zi = y^, so x? = y*. Then 2? = x. 


(2H) If x,y,z are nonzero integers and ji + 2y* = z?, then 
pte EATA, 


PROOF. Multiplying by 8 we have (2x)* + (2y)* = 2(2z)?. By (2G), 
(2x)? = (2y)? and (22)? = (9m) 5 so x? = y”, 2? = 444. 


The next result is due to Lucas (1877). We present here an easier 
proof due to Obláth (1952): 


(21) The equation 
4x*—1=3Y* 


has only the trivial solutions (+1, +1) in integers. 


PROOF. If z, y are integers such that 3y* = 4r*—1 = (22? 4-1)(222 — 
1), since 27? — 1 #0 (mod 3) then there exist integers a,b such that 
2x? + 1 = Ant, 23? — 1 = bt. By (2G), the last equation is only 
satisfied when x = +1, b= +1, hence y = +1. 
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1.3. Gaussian Numbers 


We shall prove that X* + Y^ = Z? has no solution in nonzero inte- 
gers of the Gaussian field. This result is explicitly proved in Hilbert’s 
Zahlbericht (1897, Theorem 169); see also Sommer (1907) and Han- 
cock (1931). 

The set of complex numbers a = a + bi, where i = y—1 and 
a,b € Q, constitutes the Gaussian field K = Q(i) The numbers 
a = a + bi, where a,b € Z, are called the Gaussian integers. They 
constitute a ring, denoted by A = Zi]. 

If o, 8 € K, B divides a if there exists a Gaussian integer y such 
that a = By. We write 8 | a when 8 divides a. Two nonzero 
Gaussian integers a, D are associated when a divides 8 and ( divides 
a; we write a ~ D. The Gaussian integers associated with 1 are 
called the (Gaussian) units. It is easily seen that they are +1, +i. 

A nonzero Gaussian integer a is a prime if it is not a unit and 
the only Gaussian integers dividing a are units or associated with 
a. In the field of Gaussian numbers, every nonzero Gaussian integer 
o is the product of prime Gaussian integers: @ = ^47y5::-7y,. This 
decomposition is unique in the following sense: if we also have a = 
0102 --- +, where each 6; is a prime Gaussian integer, then s = t, and 
changing the order if necessary, y; and 6; are associated (for every 
peus) 

Therefore we may define, in an obvious way, the greatest common 
divisor of nonzero Gaussian integers, which is unique up to units. 

If o, 8,y are Gaussian numbers and y Æ 0, we write a = f 
(mod y) when y divides a — 5. The congruence relation = satisfies 
the same properties as the congruence for ordinary integers. The 
Gaussian integer À = 1 — i is a prime and 2 = iX, so M | 2 but 
A? y2. We have 1 +i = i(1— i) = iA. 

There are precisely four distinct congruence classes modulo 2, 
namely the classes of 0, 1, 7, and A. Indeed, these numbers are pair- 
wise incongruent modulo 2. On the other hand, according to the 
parity of a, b, we deduce that a+bi is congruent to 0, 1, i, or A, mod- 
ulo 2. In particular, if À Ya = a + bi then a = 1 (mod 2) ora =i 
(mod 2). Then, o? = +1 (mod 4) and o* = 1 (mod 8), that is, 
a^ =1 (mod A9) since 8 = —iA6. 

Now we show: 
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(3A) The equation 
X44Y'=7 


has no solution in Gaussian integers all different from zero. 


PROOF. Let £,7,0 € Z[i] be nonzero and such that £* jn = 0?. We 
may assume without loss of generality that gcd(£, n) = 1. Indeed, if 
6 = ged(£, ol, then € = 6€', n = At, with €',7' € Zli], ged(£', m) = 
1; so ôt divides 0?, hence 6? divides 0, we may write 0 = 6?0', with 
0' € Zi]. Hence ET + 7° = 0" where gcd(£', 7’) = 1. 

From gcd(£,9) = 1 it follows that £,5,0 are pairwise relatively 
prime. We consider two cases. 


Case 1: A does not divide £r. 

By a preceding remark £^ = 1 (mod A9), a = 1 (mod A5) so 
6? = £^ + nf = 2 (mod Aê). Since 2 = id? then A? | 6?, hence 
A|0. However, M Y 0, because A* /2. We write 6 = A0,, where 
Aë. Thus 707 = 2 = id? (mod A5) hence 6? = i (mod A*), and 
therefore 01 = —1 (mod A°) since A* ~ 4, A8 ~ 8. However, A y 6, 
hence 01 = 1 (mod A5), which would imply by subtraction that 2 = 0 
(mod A9), and this is absurd. 


Case 2: divides £. 

Hence A J/0. We write £ = AE’, with m > 1, £' € Z[i], and À yE. 
The essential part of the proof consists in showing the following 
assertion: 

Let n > 1 and let € be a unit of Zi] (so € = +1 or +i). If there 
exist a, 5,7 € Zi], pairwise relatively prime, not multiples of À, and 
eM" a* + 8* = 7? then: 

(a) n > 2; and 
(b) there exists a unit €; and a1, 01,71 € Zli], pairwise relatively 
prime, not multiples of A, such that 


EX Dot + gi = 42. 


The hypothesis is satisfied with n = m,ce=1,a=€',6=7n, 7 
0. By repeated application of the above assertion, we would find a 
unit €’ and a’, 3’, € Z[i], pairwise relatively prime, not multiples 
of A, such that 
e Mo/* 4 p" = y’. 


This contradicts (a) above. 
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First we show that n > 2. Indeed €A*"a* + G4 — 1 = 4? — 1 and 
since A / B, then ^ = 1 (mod A°), so +? = 1 (mod àt). But A fv 
hence y = i (mod A?) or y = 1 (mod 2?). In the first case, 5? = —1 
(mod A?) hence AT would divide 2, a contradiction. So + — 1 = Mu 
where u € Z[i] and hence y + 1 = Mu +2 = X(u + i). But either 
A | wor if A) u then u = à (mod À) because 1 = à (mod À); so 
u = —i (mod A). We have shown that in any case A | (fs +i) so A? 
divides y? — 1 = Atu(u +i), hence A? divides £A?" + (8* — 1); but 
A6 divides 34 — 1, À a, hence An | A?" so n > 2. 

Now we prove (b). We have £A*"o^ = 4? — 8* = (y — B?)(74+ 8?). 
We note that gcd(y — 8?, y + 8?) = AT. Indeed À must divide one 
of the factors in the right-hand side, hence it divides both factors, 
because (A + 82) — (A — 82) = 26? is a multiple of M. Since + 
divides the right-hand side, this implies necessarily that A? divides 


both factors: 
YT B= Xv, 
yc B? = ayn ep! 
where v, v/ € Z[i] and gcd(v,v/) = 1. Hence cA*"a*t = Avv so by 
the uniqueness of factorization up to units, v,v’ must be fourth pow- 
ers, up to units v = wkt, v/ = wk, where &, wl € Zli], gcd(x, K’) = 
1, w,w' are units. Thus 
y- B o lie, 
y+ g - w Mn! 


and subtracting, 
26? E wu! Mn _ wA. 
Hence 
B? = —iw Mn ch iwk, 
so 
QUAM DE E wkt — a 
with units w = —iw’, wı = iw. We show that w, = 1, which suffices 
to establish statement (b). Since n > 2 then Ai | 8? — w,K*; but 
A} B, hence À Fr hence &* = 1 (mod A9) so À* | 8? — w. But A8 
divides 6+ — 1 = (6? — 1)(8? + 1) hence 8? = 1 or — 1 (mod Ai), 
This shows that w,; = +1. If w = —1, by multiplication with —1, 
we obtain the relation 


E MG -1),74 + =a (i8)?. 
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So, in all cases, we have shown (b), proving the statement. 
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1.4. The Cubic Equation 


Fermat proposed the problem to show that a cube cannot be equal 
to the sum of two nonzero cubes. See letters to Mersenne [for Sainte- 
Croix] (September, 1636), to Mersenne (May ?, 1640), to Digby [for 
Wallis} (April 7, 1658), to Carcavi (August, 1659), all mentioned 
in the Bibliography of Section L2; see also a letter to Digby [for 
Brouncker] (August 15, 1657). 

Euler discovered a proof of this statement. It used the method 
of infinite descent and appeared in his book on Algebra, published 
in St. Petersburg in 1770, translated into German in 1802, and 
into English in 1822. A critical study of Euler’s proof uncovered 
an important missing step, concerning the divisibility properties of 
integers of the form a? + 3b?. We note that in his paper of 1760, 
Euler had already proved rigorously that if an odd prime number 
p divides a? + 3b? (where a,b are nonzero relatively prime integers) 
then there exist integers u,v such that p = u? + 3v?. Yet, Euler did 
not establish in full the Lemma 4.7 which is required in the proof. 
Legendre reproduced Euler's proof in his book (1808, 1830) without 
completing the details. 

In 1875, Pepin published a long paper on numbers of the form a+ 
b4/—c pointing out arguments which had been insufficiently justified 
by Euler concerning numbers of the form a? + cb’, especially for c = 
1,2,3,4, 7. Schumacher (1894) noted explicitly the missing link in 
the proof. In 1901 Landau offered a rigorous proof; this was again the 
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object of Holden’s paper (1906) and, once more in 1915, a detailed 
proof appeared in Carmichael’s book.! In 1966, Bergmann published 
a paper with historical considerations and a thorough analysis of 
Euler’s proof. Once more, in 1972, R. Legendre pointed out that 
Euler’s proof was not perfect. In his book, Edwards (1977) discusses 
also this proof. 


(4A) The equation 
(4.1) X?-Y?-2?20 
has only the trivial solutions in integers. 


PRoor. Assume that x, y, and z are nonzero, pairwise relatively 
prime integers such that z? + y? + z? = 0. Then they must be 
distinct (because 2 is not a cube) and exactly one of these integers 
is even, say x,y are odd and z is even. Among all the solutions with 
above properties, we choose one for which |z| is the smallest possible. 
We shall produce nonzero pairwise relatively prime integers l, m, n 
which are such that |? + m? + n° = 0, n is even, and |z| > |n|. This 
will be a contradiction. Since x + y, x H are even, there exist 
integers a,b such that 2a = x+y, 2b = x — y; so x =a +b, y=a—b 
and therefore a,b Z 0, gcd(a, b) = 1 and a,b have different parity. 
Then —2? = x? + y? = (a + b)? + (a — b)? = 2a(a? + 3b?). But 
a? + 3b? is odd and z is even, hence 8 divides z?, so 8 divides 2a, 
so b is odd. We have gcd(2a, a? + 3b”) equal to 1 or 3. In fact, if 
p'(k > 1) is a prime power dividing 2a and a? + 3b? then p Æ 2 so 
p" divides a, hence 3b?; but p does not divide b, so k = 1 and p = 3. 
Now we consider two cases. 


Case 1: gcd(2a, a? + 30?) = 1. 
Then 3 does not divide a. From —z? = 2a(a? +307) it follows from 
the unique factorization of integers into primes that 2a and a? + 3b? 


are cubes: 
2a = r°, 
a? + 3b? = 83, 


where s is odd and not a multiple of 3. At this point we make use 
of a fact to be justified later: if s is odd and si = a? + 3b? with 


1A proposed simplification of Euler’s proof by Pizá (1955) is wrong, as 
pointed out by Yf (1956). 
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gcd(a,b) = 1, then s also must be of the form s = u? + 3v?, with 
u,v € Z, and 
a = u(u? — 9v?), 

{ b = 3u(u? — v?). 
Then v is odd, u is even (because b is odd), u Z 0, 3 does not 
divide u (since 3 does not divide a) and gcd(u,v) — 1. Therefore, 
2u, u + 3v, u — 3v are pairwise relatively prime and from r? = 2a = 
2u(u — 3v)(u 4 3v) it follows that 2u, u — 3v, u 4- 3v are cubes: 


2u = —n3, 
u— 3v = l’, 
u + 3v = më, 


with l, m, n distinct from 0 (since 3 does not divide u) and pairwise 
relatively prime. We conclude that 


[3 -- m? +n’? — 0, 
where n is even. Now we show that |z| > |n|. In fact, 
|z? = |2a(a? + 3b?)| = |n? (u? — 9v?) (a? + 3b7)| > 3|n?| > |n? 


because u? — 9v? = I’m? # 0 and b Æ 0, since it is odd. This 
contradicts the minimality of |z|. 
Case 2: gcd(2a, a? + 3b?) = 3. 

We write a = 3c. Thus, c is even and indeed 4 divides c, while 3 
does not divide b (since a, b are relatively prime). So —z* = 6c(9c? + 
30?) = 18c(3c? + b?) where ged(18c, 3c? + 0?) = 1. Indeed, c is even 
and b is odd, therefore 3c? + b? is odd, 3 does not divide 3c? + b? and 
gcd(b,c) = 1. By the unique factorization of integers into primes, 
18c and 3c? + 0? are cubes: 

18c = r°, 
3c? + b? = 83, 
where s is odd and 3 divides r. By the same result already quoted, 
s = u? + 9v? with u,v € Z and 


{ b = u(u? — 9v?), 


c = 3v(u? — v’). 


Thus u is odd, v is even (since b is odd), v Z 0, gcd(u, v) = 1. Also, 
2v, u + v, u — v are pairwise relatively prime. From r? = 18c = 


L4. The Cubic Equation 27 


54v(u + v)(u — v) we deduce that (r/3)? = 2v(u + v)(u — v) and 
2v, u+ v, u — v are cubes: 


2v = —n*, 
u+v =l’, 
u—v =-m?. 


Thus D + m? + n? = 0 with l, m, n different from 0, and n even. 
Now we show that |z| > |n]. In fact, 


|z|? = 18|c/(3c° + b?) 
= 5A4lv(u? — v?)|(3c? + b?) 
= 27|n|*|u? — v?|(3c? + 5?) 
> nj. 
Since u? — v? ml Æ 0, |32 + b?| > 1. Again, this contradicts 


the choice of |z| as minimal. 


We shall now justify the step concerning the expression of s as 
s = u? + 3v°. For this purpose, we use arguments, already known to 
Fermat, in connection with the study of integers of the form u?+v?. 

Let S be the set of integers of the form a? + 3b? (a,b € Z). S is 
closed under multiplication, because 


(4.2) (a? + 3b?) (C? + 3d?) = (ac + 3bd)? + 3(ad F bc)? 
(the equality holds with corresponding signs). 
LEMMA 4.1. Let p be a prime different from 2 and 3. Then the 
following conditions are equivalent: 
(1) p=1 (mod 6). 
(2) —3 is a square modulo p. 


3) The polynomial X? + X +1 has a root in F,. 
p 


PROOF. For the equivalence of (1) and (2) we compute the Legendre 
symbol, using Gauss’ reciprocity law: 


EIER CHG 


So (—3/p) = +1 if and only if p = 1 (mod 3), that is, p = 1 (mod 6). 
For the equivalence of (2) and (3), we write 


2 
X?4X41=(X+4) +8. 
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If there exists a € F, such that o? -- a. -- 1 = 0 then -3 = 4 (a — HM 
and conversely, if —3 = 6? with 8 € F,, we take a = —5 + 8/2 so 
o? Fa 1- 0. 


1 
2 


LEMMA 4.2. If k is a nonzero integer, if p is a prime, and p = 
c? --3d? € S, pk = a? -- 3l? € S then p divides ac + 3bd and ad + bc 
(with corresponding signs ) and 


ES 2 
TN (= —) 13 (5) sa 
p p 


PROOF. We have 


— (a? +38?)(c? + 3d?) ` (5 + -i 7 ( ad F bc ) 
CU Meu o Qua d 3d 


by (4.2). But (ac + 3bd)(ac — 3bd) = a?c? — 9b?d? = a?(c? + 3d?) — 
3(a? + 3b*)d? = (a? — 3kd?)(c? + 3d?). Since c? + 3d? = p is a prime 
then, say, p divides ac + 3bd, that is, (ac + 3bd)/p € Z. Hence also 
3 ((ad — bc)/p)" € Z and therefore (ad — bc)/p € Z, thus k € S. 


LEMMA 4.3. If p is a prime then p € S if and only ifp=3 orp=1 
(mod 3). 


PROOF. If p = a? + 3b?, p 4 3, then b 4 0, so p = a? (mod 3), and 
3 Ja; thus p = a? = 1 (mod 3). 

Clearly 3 € S. Let p = 1 (mod 3). Since (—3/p) = 1 then 
there exists t such that 0 < t < p/2 and —3 = t? (mod p). Then 
mp=t?+3< (p/2) +3 < p so 0 «m < p. Now we observe that 
for every t > 1 there exists at most one prime p Æ 2,3 such that 
p|t?+3 but p fu? +3 for every u 1 € u « t. 

Indeed, we assume that there exist distinct primes p, p' as above, 
p < p. By the preceding remark, we must have 0 « t « p/2, and 
t? +3 = pm with 0 < m < p. Since p | t? +3 then p' | m so 
p' € m < p, which is a contradiction. 

Now we are ready to prove the statement. Suppose there exists a 
prime p, p = 1 (mod 3), such that p ¢ S. We take the smallest such 
prime p. Let t > 1 be the smallest integer such that p | t? + 3, so 
0<t< p/2, ? --3 2 mp with 0 < m < p. If p' is any prime dividing 
m, m = p'm', then p' < m < p, so p' € S. From p'(pm’) = pm = t? + 
3 € S it follows from Lemma 4.2 that pm’ € S. If m’ = 1 then p € S, 
as we intended to show. If p" is a prime dividing ml. m = pm, 
then p" € m' « p so p" € S, hence p"(pm") = pm’ € S and by 
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Lemma 4.2, pm" € S, where m" < m’. Repeating this argument, we 
eventually arrive at p € S. 


It is worth giving another proof, using Dirichlet’s pigeon-hole prin- 
ciple, of the fact that if p= 1 (mod 3) then p € S. 

From p = 1 (mod 3) there exists t£, 1 < t < p — 1, such that 
—3 = t? (mod p). We consider the set of all pairs (m, n) such that 
0 € m, n < [yp]; since there are more than p such pairs, then there 
exist two distinct pairs (m, n), (ml, n") such that m+ nt =m’ + n't 
(mod p). So m Æ m and n Æ m', say n > ml, hence t = (m'—m)/(n 
n') (mod p); since 0 < n — n' < Jp and 0 < |m — m| < p, then 
t = ta/b (mod p) with 0 < a < yp, 0 < b < „p. Thus a? + 3b? = 0 
(mod p) and we may write a? + 3b? = kp, with 0 < k « 4. It follows 
that a? = k (mod 3) hence k = 0 or 1 (mod 3), that is, k = 1 or 3. 
If k = 1 then p € S. If k = 3 it follows that 3 | a, so a = 3a’ and 
dividing by 3, p = b? + 3a"? € S. 


LEMMA 4.4. Let m = u? + 3v?, with u,v Æ 0, gcd(u,v) = 1. If p is 
an odd prime dividing m then p € S. 


PROOF. 3 € S, so we may assume that p 4 3. Since p divides m then 
p does not divide v, otherwise it would also divide u, contrary to the 
hypothesis. Let v’ be such that vv’ = 1 (mod p). So (uv)? = — 

(mod p) and (—3/p) = 1, that is, p = 1 (mod 3). By Lemma 4.3, 
pes. 


We complete the above lemmas as follows: 


LEMMA 4.5. If p is a prime, p € S, then its representation in the 
form p = a? + 3b? (with a > 0, b > 0) is unique. 


PROOF. We apply Lemma 4.2 with k = 1, thus p = a?+3b? = c?+3d? 
(where a,c > 0, b > 0, d > 0). Hence 


ch 2 2 
T: (= + 2) E (= =) | 
p p 


so p = ac+3bd, ad = bc. Therefore pd = acd+3bd? = +bc?+3bd? = 
+b(c? + 3d?) = +bp. Hence d = +b thus b = d, hence a = c. 


LEMMA 4.6. Let m = 3 or m= u? -3v?, with u,v 4 0 and gcd(u, v) 
= 1. If m is odd and m = [T4 p; (where p1,... , Pn are primes 
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and e; > 1) then there exist integers a;,b; (i = 1,...,n) such that 
pi = a; + 3b? and 


u +vy-—3 = LG: + b;V—-3)". 
i=1 


PROOF. The proof is by induction on m. It is trivial when m = 3. 
Let m > 3, so m = u? -3v?, with u,v Z 0, gcd(u, v) = 1. Let p bea 
prime dividing m, and m = pk. By Lemma 4.4, p = a? + 30?, and by 
Lemma 4.2, k = c? + 3d? where c = (ua + 3vb)/p, d = (ub F va)/p 
(with corresponding signs). We also have (a + bÿ3)(c F d/—3) = 
(ac + 3bd) + (bc — ad)4/—3 where 


ac + 3bd = -(ua? + 3vab + 3ub? F 3vab) = u, 


1 
+(bc — ad) = = ee + 3ub? — uab + va?) = v, 


that is, 


(a E b/=3)(eF dy/—3) = ut v /3. 


If k = 1, it is trivial. If k Z 1 then either k = 3 or k #3. In 
this case, c # 0 (otherwise c = 0, so d divides u,v, hence d = 1 
and k = 3, contrary to the hypothesis); similarly d 4 0 (otherwise 
d = 0, so c divides u,v hence c = 1 and k = 1, contrary to the 
hypothesis); moreover, gcd(c,d) = 1, because gcd(u,v) = 1. By 
induction, the result is true for k, hence c F d\/—3 is expressible in 
the form indicated. Since (a + bÿ—3)(c+ d/—3) = u + v/—3 then 
the result also holds for m. 


LEMMA 4.7. Let E be the set of all triples (u,v, s) such that s is odd, 
gcd(u, v) = 1 and s? = u? + 3v?. Let F be the set of all pairs (t, w) 
where gcd(t,w) = 1 and t Z w (mod 2). The mapping ®: F — E 
given by (t, w) = (u,v, s) with 

u = t(t? —9w?), 

v = 3w(t? — u?), 

s = Ê + 3w?, 
is onto E. 


PROOF. It is clear that u? + 3v? = s?. Since t,w have different 
parity, then s is odd. Next we show that gcd(u,v) = 1. Indeed, 
first we note that gcd(t? — 9w?, t? — w?) = 1 because if a prime p 
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divides t? — 9w? and t? — w?, it divides 9t? — 9w? so also 8t?, hence 
p = 2 (since p cannot divide t because gcd(t,w) = 1). Since t,w 
have different parity, this is impossible. Now we assume that p is a 
prime, e > 1 and p° divides u and v then p | t or p | t? — Ou? hence 
p | t in both cases; so p / w(t? — w?) hence p = 3. From 3° | v since 
3|¢ then e = 1 thus gcd(u,v) = 1 or 3. If 3 | u, 3 | v then 3 | t, 3 yw 
so 3| s but 3? Y s. However s? = u? + 3v? so 3? | s? hence 3? | s, 
which is a contradiction. This shows that (t, w) = (u,v,s) € E. 

Conversely, given (u,v,s) € E, let s? = TL," be the decom- 
position of s? into a product of primes (pi,... , p, distinct, e; > 1); 
so e; = Ae for every i. By Lemma 4.6 there exist integers a;, b; 
(i — 1,... ,n) such that p; = a? + 3b? and 


u + vV—3 = II. Tby-3)* 


i=1 


Let t,w € Z be defined by the relation 


LG: + b,/—3)*% =t+ wV—3, 
i=l 
so u + v /—3 = (t + wy—3)?. Computing explicitly the cube in the 
right-hand side, it follows that u = t(t? —9w?), v 2 3w(t? — w?). Fi- 
nally, W taking conjugates, u—vV/—3 = (t—w/—3)?, so by multiply- 
ing, s? = u?+3v? = (t?+3w?)3, hence s = t?+3w?. It follows that t, u 
have different parity and also gcd(t, w) = 1, ®(t,w) = (u,v, s). 


In this way we have established all the steps in Euler’s proof of 
(4A). 

Now we prove the following result due to Kronecker (1859); see 
also Vränceanu (1956, 1960). It is a consequence of Fermat’s theorem 
for the exponent 3. 


(4B) 
(1) For every integer m 4 0 the only solutions in integers of the 
equation 4X? — 3mY? = m? are (m,m) and (m, =m). 
(2) The only rational solutions of AU? + 27T? = —1 are (—1, $) 
and (—1,-3). 
(3 XŸ — X + i are the only cubic polynomials with rational co- 
efficients such that the sum of roots is equal to O and the 


discriminant equal to —1. 
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(4) If the discriminant of a cubic polynomial with rational coef- 
ficients is the sixth power of a nonzero rational number, then 
its roots are of the form r + sV3sin(r/9), r+sV/3sin(27/9), 
r — sV3sin(47/9). 


PROOF. (1) Ifx,y are integers and (x, y) satisfies 4x? —3rny? = m? 
then letting u = —2z, v = y +m then u? +v? = —82? + y? + 3y?m + 
3ym? + y? = —2m? — 6my? + y? + 3y?m + 3ym? + y? = (y — my. 
Thus either z — 0 (which would imply —3y? — m?, an absurdity) or 


y = +m; in this case we have necessarily x = m 

(2) Let u,t be rational numbers, such that 4u? + 274? = —1. 
We write u = ain, t = y/3m, so Ar? + 3m = —m?. By (1) w 
have x — m, y = +m, hence u = —1, t= +; 


(3) If X? + aX + b has rational Geer and discriminant 
6 = —1, since 6 = 4o? + 27b? then by (2), a = —1, b = +3. 

(4 If f(X) = X*+aX?+a2X +a; has rational coefficients, if 
g(X) = f (X — a,/3) then g(X), f(X) have the same ege 
and g(X) is of the form g(X) = X°+uX +t with rational coefficients. 

If the discriminant is a sixth power of a nonzero rational number, 
say —(4u + 2742) = rê, then 4(u/2)? + 27(t/r?)” = —1. Hence 
u = ter 3, so 


g(X)= X’? — r°X + ` =r’ IGI (=) + ] : 


'The roots of the polynomials 


X*—-Xr 
are +(2V3/3)sin(r/9), +(2V3/3) sin(21/9), +(2V3/3)sin(4r/9), 


hence those of f(X) are of the form indicated. 


wile 


Conversely, in 1944 Schmid established the equivalence below and 
proved Fermat’s theorem for the exponent 3 by showing directly the 
validity of (2); see also Vranceanu (1956, 1960, 1979) where this fact 
is explicitly spelled out: 


(4C) The following statements are equivalent: 
(1) Fermat’s last theorem is true for the exponent 3. 
(2) For every integer m # 0 the only solutions in integers of the 
equation 4X? — 3mY? = m? are (m,m), (m, =m). 
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TABLE 2. FLT for the exponent 3. 


Author Year 
Kausler 1795/6, publ. in 1802 
Legendre 1823, 1830 
Calzolari 1855 
Lamé 1865 

Tait 1872 
Gunther 1878 
Gambioli 1901 

Krey 1909 
Rychlik 1910 
Stockhaus 1910 
Carmichael 1915 

van der Corput 1915 

Thue 1917 
Duarte 1944 


3) The only rational solutions of AU? + 27T? = —1 are (—1,1), 
(-1,-3) 
zl. 


PROOF. We have seen in (4B) that (1) implies (2) and also that (2) 
implies (3). Now we assume that statement (3) is true and we shall 
derive that Fermat's theorem is true for the exponent 3. 

Assume, on the contrary, that there exist nonzero, pairwise rela- 
tively prime integers x,y,z such that z? + y? = z?, so y Æ z. Let 


u = z/(y—2)t = (y + z)/(3y —z)). Then 4$? + 27t? = —1, 
as is easily seen. So, by assumption, u = —1,t = +1/3, hence 
y —z = +(y + z); this leads to y = 0 or z = 0, contrary to the 
hypothesis. 


In 1885, Perrin showed that if X? + Y? + Z? = 0 has a nontrivial 
solution in nonzero relatively prime integers then it would have an 
infinite number of such solutions, which are obtainable from the 
assumed solution by means of rational operations. Of course this 
statement is not of interest since there are no solutions of the type 
indicated. 

Proofs of Fermat's theorem for the exponent 3 were also published 
by the authors listed in Table 2. 

We conclude this section with the study of an equation similar to 


34 I. Special Cases 


(4.1). With the method of infinite descent, we show (see Legendre, 
1808, 1830): 


(4D) For every m > 0 the equation 
(4.3) X? +Y? Gm 


has only the trivial solutions in integers; namely, the solutions are 
(x,y,z) with xyz = 0 and if m = 1 also (x,x,x) with any x £0. 


PROOF. Assume that x,y,z are nonzero integers, that x,y,z are not 
equal if m = 1, and that z? + y? = 2™z?; we may assume without 
loss of generality that gcd(z, y, z) = gcd(x, z) = gcd(y, z) = 1. 

If m = 3m! with m > 0, (a, y, 2” z) would be a nontrivial solution 
of the equation X? + Y? = Z°, which contradicts (4A). Thus 3 ym. 

Since m Æ 0 then x,y have the same parity. If x,y are not both 
odd, let s > 1 be the largest integer such that 2° divides x,y. We 
write x = Zeil, y = 2*5y' so x’ or y! is odd and x"? + y? = 27-3553; 
thus Lol, y’, z) is a nontrivial solution of an equation of the same type; 
then z', y' have the same parity, so both are odd. Thus, changing m 
into m — 3s, we may assume without loss of generality that (4.4) has 
a nontrivial solution (x,y,z) with x,y odd, and also gcd(z, y, z) = 
gcd(z,z) = ged(y,z) = 1; therefore gcd(x,y) = 1. With the same 
argument, we may also assume that z is odd. So, we have 


22 = £? +y? = (x -- y)(z? — xy+Y), 


with gcd(x +y, z? — zy 4- y?) = 1 or 3. Indeed, if p is a prime, e > 1 
and p° divides both x + y and x? — xy + y? then x = —y (mod p°) 
so z? — zy + y? = 3x? (mod p°); thus p° | 3x7; since ged(z, y) = 1 
then p Yx, hence p? | 3, that is p° = 3, proving the statement. 

Moreover, since x? — zy + y? = (x + y)? — 3xy, then 3 | x + y is 
equivalent to 3 | x? — xy + yz, which is in turn equivalent to 3 | z 
and again to ged(x + y, x? — zy + y?) = 3. 

We are led to two cases. 


Case 1: 3 fz. 
Since gcd(z + y, x? — zy + y?) = 1 and à? — zy + y? is odd there 
exist odd, relatively prime integers a,b such that 
x +y = 2™a3, 
z? -— ry +y =b, 
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with z = ab. Since x + y, x — y are even, we may write 


x+y\° T — d ? 
p 9 
( 2 ) P ( 2 
It follows from Lemma 4.6 that there exist integers t, w such that 


T y = (t + w¥—3)? 


2 
and 
(x +y)/2 HE — 9w?), 
(x — y)/2 = 3x(t? — w°), 
b= P +3w. 

If t = 0 or t? — 9w? = 0 then x = —y so z = 0. If w = 0 or 
t? — w? = 0 then z = y —0o0ry = y = z = +1 with m = 1. 
This was excluded by the hypothesis. It follows that |t? — 9w?| 4 1, 
otherwise t + 3w = +1, t — 3w = +1 which is easily seen to be 
impossible. 


We have 3 Yt, because otherwise 3 | b so 3 | z, contrary to the 
hypothesis. Since b is odd, so are t + 3w and t — 3w. Hence t, t + 
3w, t — 3w are nonzero, pairwise relatively prime integers. From 
2m-1g3 = t(t--3w)(t —3w) it follows that there exist nonzero integers 


c, d, e such that 
bao le. 
t+3w = di, 


t—3w = æ, 


and c,d,e are odd, pairwise relatively prime, with a = cde. Hence 
d? + e? = 2™c? so (d,e,c) is a solution of the given equation. But 
s"-lle = |t| < |t| x |t? — 9w?| = 2"-!|a|? hence |c| < |a|. Also, 
s" lal = |z-Fy| € |x--yl|z? —xy--y^| = |x? -y?| = 2""|z[? so |e] < [2]. 
Since 3 / c, repeating the argument with the solution (d,e,c) this 


would yield a sequence of solutions (d1, €1, c1), (d2, €2, €2),... with 
\z| > al > c| > ---, all the c; being nonzero integers, which is 
impossible. 

Case 2: 3 | z. 


Now gcd(z 4- y, z? — zy-- y?) = 3. We note that 3? / 3? —zy4- y? = 
(x + y)? — 3xy; otherwise 3 | zy so 3 divides both x and y, contrary 
to the hypothesis. 
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Thus, there exist odd, relatively prime integers a,b such that 
r +y = ts 30, 
z?— ry +y = 3b, 
with 3 Ÿb and z = 3ab. Since x+y, x—y are even and 3 | z--y we may 
write 3b? = Saad I/D? + 3((x — y)/2)? hence b = ((x — y)/2) + 
3((x + y)/6)°. It follows from Lemma 4.6 that there exist integers 
t, w such that 


—— n = (t+ wy -3)? 


2 

and 
(x — y)/2 =t(t? —9w^), 
| (a+ y)/6 = 3w(t? — w?), 
b=t?+3u”. 

If t = 0 or °? —9w? = 0 then z = y = 0 or x = y = z = +1 with 
m = 1. If w = 0 or t? — w? = 0 then x = —y so z = 0. This was 
excluded by the hypothesis. It follows that |t? — w?| Æ 1 otherwise 
t+w= 1, t — w = +1, which is easily seen to be impossible. 


Since b is odd, so are t+ 3w, t— 3w, hence also t+w, t—w. There- 
fore w, t — w, t+ w are nonzero pairwise relatively prime integers. 
From 27?-!g? = (a+ y)/18 = w(t — w)(t + w) it follows that there 
exist nonzero integers c, d, e such that 


icm. 
| t+w = d, 
t— w =e, 
and c,d,e are odd, pairwise relatively prime, with a = cde. Hence, 
d? — e? = 2"c#, so (d, —e,c) is a solution of the given equation. 
But 2^?-!|e? = Jl < |w||? — w?| = |£ + y|/18 = 2"-!|a[?, thus 
\c| < |a|. Also 
2"3'|a = |æ + yl 
< tul - EEN 
T 3 
Lei + y?| 
3 
2™ |z|’ 
3 , 


hence |c| < |a] < |z|/3. 
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Whether 3 divides c or not, we repeat the argument in the first 
or second case, and this leads to a sequence of solutions (d4, €1, c1), 
(dz, €2,C2),... with |z| > [al > [cl > ---, all the c; being nonzero 


integers, which is impossible. 


The 


theory of the equation X? + Y? = AZ? has been further 


developed by Legendre (1808, 1830), Pepin (1870, 1875, 1881), Lucas 
(1878, 1880), Sylvester (1856, 1879) and Hurwitz (1917) who proved 
the impossibility of the equation in integers, for many values of A 
— but we shall not enter into this matter. Other interesting papers 
on ternary cubic diophantine equations are from Hurwitz (1917), 
Mordell (1956); see also Mordell (1969). 


1657 


1760 


1770 


1802 


1808 


1823 


1830 
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1.5. The Eisenstein Field 


We shall now give the proof of Gauss that Fermat’s cubic equation 
has only trivial solutions in the Eisenstein field. The set of complex 
numbers a + bÿ—3, where a,b are rational numbers, constitute a 
field, called the Eisenstein field and denoted by K = Q(4/—3). The 
numbers a = (a + b\/—3)/2, where a,b are ordinary integers of the 
same parity, are called the integers of K. They constitute a ring, 
denoted by A. If a,@ € K, B divides a if there exists an integer 
y E€ A such that a = By. We write 8 | a when 8 divides a. Two 
nonzero integers a, are associated if œ divides 6 and B divides 
a; we write a ~ f. The integers associated with 1 are called the 
units of K. It is easily shown that they are +1, C, +¢?, where 
C = (—1 + vV=3)/2, € = (—1 — V—3)/2. We note that C? = 1, that 
is, Ç is a primitive cubic root of 1, and 1 + ¢+¢? = 0. A nonzero 
integer a € A is a prime if it is not a unit and the only integers 
dividing o are units or associated with a. 

In the particular field Q(4/—3) under consideration, it is true 
that every nonzero integer o is the product of prime integers: a = 
^h^ófa:::"y,. This decomposition is unique, in the following sense: if 
we also have a = 64165:--6,, where each 6; is a prime of K, then 
s = t and, changing the order if necessary, y; and 6; are associated 
(for every i = 1,...,s). Therefore, we may define, in the obvious 
way, the greatest common divisor of nonzero integers of A which is 
unique up to units of A. 

The proofs of the following properties may be found in any stan- 
dard text on algebraic numbers, for example, in Ribenboim (1999). 
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The conjugate of a = (a+ bJ/—3)/2 is à = (a — bÿ—3)/2. The 
norm of a is N(a) = ou = (a? — 3b?)/4. 

If o € A then Aa = {Ga | 8 € A} is the ideal of multiples of a. If 
a, 0,7 € A, a #0, we write 


B zy (mod a), 


when a divides 8 — y; we say that 5 and «y are congruent modulo 
a. This is an equivalence relation on the ring À and the set of 
equivalence classes is denoted by A/Aa; the equivalence class of 3 
is denoted by 8 and called the residue class of 3. We define the 
addition and multiplication of residue classes as follows: 6 +7 = 
B o- y, B-¥ = fy. Then A/Aa is a ring, called the residue ring of A 
modulo a. The residue ring A/Aa is finite; its number of elements 
is equal to [N(a)|. 

Now we describe the decomposition of prime numbers p as prod- 
ucts of prime elements of the ring A. 


(1) p = 3 is ramified, that is, 3 = (—C?)A7, so 3 ~ AT, where 
àA =1-¢ = (3 — yJ-—3)/2, À is a prime element of A. There are 
three residue classes of A modulo A; the set {0,1,—1} is a system of 
representatives of the field A/AA. The norm of À is N(A) = AA = 
(1-—¢)(1-¢7) =1-¢€-@41=3, since 14+¢6¢+C=0. 

(2) p = 2 is inert, that is, 2 is a prime of A. There are four residue 
classes of A modulo 2, that is, A/A2 is the field with four elements; 
the norm of 2 is N(2) = 4. 

(3) If p = 1 (mod 3) then p ~ Ah, where Au, Az are prime ele- 
ments of A which are not associated (A, % À); we say that p splits 
(or is decomposed). Now A/Ap has p? elements and it is the direct 
product of two copies of the field F, with p elements, and 


NX) = NO) =p 


(4) If p = —1 (mod 3) then p is a prime element, that is, p is inert; 
A/ Ap is a field with p? elements, N(p) = p?. 


We shall not need (2), (3), (4) above in Gauss' proof. 
We shall need the following precise congruence: 


LEMMA 5.1. If a € A and À does not divide a then o? = +1 
(mod AT). 
PROOF. Since a Æ 0 (mod A) then a = +1 (mod À). First, we 


assume a = 1 (mod À), so a — 1 = 8A where 8 € A. Then a — 6 = 
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(LEE) = BNA AS ABP De epe? eg KE 
A(B +1) c CA = A(8 — ¢?). Hence o? - 1 = (a—-1)(a—-¢)(a—¢?) = 
A38(B8--1)(8—6?). But 1—¢? = (1+ ¢)A, or ? = 1 (mod À). Hence 
B, G+1, B—C? are in three different classes modulo A, and at least one 
is a multiple of A. Therefore a? = 1 (mod À). If a = —1 (mod À) 
then —a? = (—a)? = 1 (mod åf), so a? = —1 (mod À). 


The following result of Gauss implies (4A): 


(5A) The equation 
(5.1) X? +Y’ +Z? =0 


has no solution in algebraic integers of Q(V—3), all different from 
0. 


PROOF. Assume that £, 7, 0 € A are nonzero and satisfy £?4-1?--0? = 
0. If gcd(£, n, 0) = 6 then €/6, 7/6, 0/6 satisfy the same equation 
and gcd (£/6, 7/6, 0/6) = 1. So we may assume gcd(é, 7,0) = 1 and 
therefore, £, n, 0 are pairwise relatively prime. So A cannot divide 
two of these elements £, 7, 0. We may assume, for example, that 


ALE, ^ n. 
First Case: We assume that AF. 


'Then 
63 
63 (mod A7), 


so 0 = ¿8 + n? + 0° = 1 1 (mod à). The eight combinations 
of signs give +1 or +3. These are congruent to 0 modulo À, since 
+1 are units, +3 are associated with A?, hence not multiples of A7. 


(mod A7), 
(mod A7), 


TT 
= m= M 


Ki 


Second Case: We assume that A | 0. 
Let 0 = A", Y E A, m > 1, and A does not divide v. The essen- 
tial part of the proof consists in establishing the following assertion: 
Let n > 1, and let € be a unit of A. If there exist o, face A, 
pairwise relatively prime, not multiples of A, and oli Le A?" = 0, 
then: 
(a) n 2 2; and 
(b) there exist a unit € and a1, 61, yı € A, pairwise relatively 
prime, not multiples of À, such that o2-- 03 4- e; A379 43 = 0. 
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The hypothesis is satisfied with n = m, e = 1,a =E, Bb =n, 7 
w. By repeated application of the above assertion, we would find a 
unit €’, and oi, 8', y! € A not multiples of À, such that o? + 9? + 
e' M4? = 0, and this contradicts (a) above. 

First we show that n > 2. Indeed, À ya and À KO. So by Lemma 
5.1, a = +1 (mod À), 8* = +1 (mod A*) and +1 +1 = Ana? 
(mod àt), A Yy. Since y Ÿ +2 the left-hand side must be 0. From 
À Yy we conclude that 3n > 4, so n > 2. 

Now we prove (b). We have 


(5.2) ena = o? + B? = (at B)(a +CB)(a + C78). 


Since A is a prime element dividing the right-hand side, then it must 
divide one of the factors. But a+ 8 2 «4-68 =a + C (mod À) 
because À = 1— 6, 1— C? = —(?A so À must divide all three factors; 
hence (a + 8)/A, (a +6B)/X, (a+ C?8)/A € A and 


come = (248) (268) (242) 


Since n > 2, A divides the right-hand side, hence at least one 
factor. It cannot divide two of the factors, otherwise two among 
a+, o- CB, a-- C are congruent modulo A. We check that this is 
not possible: (a4-8) —(a-- G8) = 8(1—6) = BA = 0 (mod 4?) implies 
A | 8, a contradiction; (a+ 8) — (a +628) = 8(1— 6?) CA 20 
(mod A?) implies À | B again; (a + 68) — (a + C8) = 68(1— €) = 
¢3X = 0 (mod A?) implies À | 8 again. 

Let us assume that A divides (a+ 8)/A (the other cases are treated 
by replacing 8 by CB or ¢? 6). Then A?^-? divides (a+3)/X. There- 
fore 


(5.3) a + CB = Aka, 


a + C8 = ÀKa, 


with K1, K2, K3 € A, À not dividing #1, K2, ka, Multiplying, we have 


at B= Mn, 


(5.4) — EY = KiKaKa. 


We note that «1, K2, ka are pairwise relatively prime. For exam- 
ple, if 6 € A divides ru, k2, then 6 divides (a + 8) — (a + CB) = 
BU — C) = BA, and similarly when 6 divides k1, «3 (or ro, k3). But 
À does not divide k1, 2,3, so 6 is not associated with A; hence 6 
divides @ and therefore also o, which is a contradiction. 
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By the unique factorization in the ring A, it follows from (5.4) that 
the elements Kı, K2, kg are associated with cubes, i.e., there exist 
units w; € A and elements u; € A such that k; = wiu? (i = 1,2,3). 
So 


(5.5) a + CB = Auw, 
a + CB = Aws. 


We note again that u1, H2, jj are pairwise relatively prime and À 
does not divide 44, 4», u3. Thus 


0 = (a+ 8) + (a +68) + C (ao - CB) 
= A7? ulus + CAuaus + C Ausus, 


| Wi E B = AU uw, 


so 
ua rus + T'A?” u7 0, 
where 7, 7’ are units, H1, H2, H3 € À are not zero, and gcd(j», u3) = 
1. If 7 = 1, we have established (b). If 7 = —1, we replace u3 by 
—p3 and have again shown (b). To complete the proof we show that 
the unit 7 cannot be equal to +Ç or + ¢?. In fact, u$ + ru$ = 0 
(mod A?). Since u3 = +1 (mod A*), u3 = +1 (mod A‘) then u3 + 
TU = +1 +7 = 0 (mod M). However, +1 + ¢ #0 (mod M) and 
143 C 40 (mod X), so 7 C, LC, and the proof of (b) is now 
complete. 
As already explained, this suffices to prove the theorem. 


We take this opportunity to indicate some results similar to (5A) 
that may be proved with the same methods. They may be attributed 
to Euler and Legendre. 


(5B) Let p be a prime, p = 2 or 5 (mod 9). Ife is a unit of K, 
if there exists x € A such that x? =e (mod p), then € = +1. 


PROOF. Assume that £ = +Ç or C? and that there exists x € A such 
that z? = € (mod p). Since p = 2 (mod 3) then p is a prime element 
of A, A/Ap is a field with p? elements, and zë") = 1 (mod p). But 
p?—-1=(p+1)(p—1) = 3(r + 1)(3r +1), where p = 3r + 2. Hence 


2. 
r+1 — go *DGrr1) = y? 1 — 1 


€ (mod p). 


If r =0 or 1 (mod p) then p divides 1+¢ or 1 € C?. Noting that 
1-—¢? = (14+¢)(1—¢) and that 1+¢€ — 1— C, 1- C? = —¢ are units, 
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then p divides 1 — ¢. This would imply that p = 3, which is contrary 
to the hypothesis. Hence r = 2 (mod 3) and p = 8 (mod 9), which 
is again a contradiction. 


Following Mordell, we prove the classical result: 


(5C) Let p be a prime, p = 2 or 5 (mod 9), and let € be a unit of 
Q(¢). The equation 


(5.6) X? Y? + eaZ? — 0, 
with a = p or p°, a # 2, has only the trivial solution (x,y,z) in Q(C), 


namely z = 0, x = —y, or —Cy, or —C?y. If a = 2 then there are 
also the solutions zx? = y? = z? = +1 when € = —1. 


PROOF. To begin we note that if (x,y,z) is a solution with z = 0 
or y = 0 then necessarily z = 0. Indeed, if both x,y are 0 then so 
is z. If, for example, y 4 0 and € Z 0 we may assume gcd(y, z) = 1. 
Then p divides y, so p? | az?, hence p | z, a contradiction. 

Assume now that x,y,z € A are such that x? +y? --£az? = 0, with 
x,y,z #0. We may also assume that gcd(x, y, z) = 1 from which it 
follows that x, y, z are pairwise relatively prime. 

Among all possible solutions, consider one for which the absolute 
value of the norm |N(xyz)| is minimum. Note that x? + y? 4 0 since 
z#0;so x À —y, —-Cy, —Cy. 

Consider the Lagrange resolvents 


a=xr+y, 

B= Cat Cy, 

y= Cat Cy. 
Then a, 6,7 € A, a, 6,7 = 0 and 


aby = CS + C + (1+ day) (Ca + y) 
= CLC + Cy? + (1464 Cry + (1464 Cry) 
= g? + y? = EH, 


Let 6 = gcd(a, 8, y) € A, so gcd(a/6, 3/6, y/6) = 1 and 


Cm AEN a 
Br ig LS 
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Hence a divides one and only one of the factors in the left-hand side, 
say 7/6 (the other cases are similar). By the unique factorization 
theorem, which is valid in the ring A, we have 


a/6 = &e1ai, 
8/6 = £237, 
7/6 = exa, 


where €1,€2,€3 are units of A, and o4,0,,*1 € A, o4, Dim Æ 0, 
gcd(o4, 31, 51) = 1. Hence 


£101 + E203 + Eza} = 0. 
Let &' = €9/€,,€” = €3/€1; hence 
(5.7) aË + eB Leon =0. 


We have p / Bi otherwise p | o1, so p? | an”, hence p | 7 and this 
is impossible because gcd(a1, £1, y1) = 1. Taking the classes modulo 
p, (5.7) yields 

aj +3; =0 (mod p), 
hence €’ is a cube modulo p. By (5B), &' = +1, so 


a + (x4)? + "a? =] 


and (o4, +61, 71) is another nontrivial solution of (5.6), with gcd(a, 
X, ^1) = 1. By assumption, 


NP < Nas = IN (522) =| (5) 
hence |N(ózy)| € 1. This implies that x,y,6 are units. Hence x? = 
+1, y? = +1, and +1 + 1 + caz? = 0. 

If a Æ 2 then z = 0, a contradiction. 

If a = 2, we must have either z = 0, or x? = y? £0, x? +ez = 0; 
then € is a cube modulo p, therefore by (5B), € = +1. But clearly 
€ = +1 would imply z = 0, so € = —1 and z? = y? = 2° = +1. 


3 


H 


As an immediate corollary we have: 
(5D) The equation 


(5.8) X34 4¥% =1 


has no solution in nonzero integers. 
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PROOF. Consider the equation 
(5.9) X’ +T? +4Y? — 0. 


If (x, y) is a nontrivial solution of (5.8) then (x, —1, y) is a nontrivial 
solution of (5.9), which is impossible, by (5C). 


We also note other consequences: 


(5E) The equations 


(5.10) A? —27Y° = 97", 
(5.11) X9? — 16 x 27Y° = Z?, 
(5.12) 16X° —27Y° = Z$, 


have no solutions in nonzero integers. 


PROOF. Assume that x° — 27y? = 22%, with x,y, z nonzero integers. 
Then (z?)? + (—3y?)? — 22% = 0. By (5C), xê = y$ = 2° = +1 and 
therefore x = y = z = 1, which is impossible. 

If zê — 16 x 27y9 = 2%, then multiplying with 2? we have (2x7)? + 
(—22)? — 2(2? x 3y?)? = 0. By (5C), (2x°)° = (—2z)? = (2? x 3?? = 
+1 and therefore 27? = +1, a contradiction. 

Finally, if 16:9 — 27y9 = z? then (3y?)? + z? — 2(2:?)? = 0, so 
by (5C), (3g) = z2? = (207)? = 1, thus 2z? = +1, a contradic- 
tion. 


Furthermore, Legendre showed: 
(5F) The equation 
X? + Y? = 378 


has no solution in integers different from zero. 


PROOF. The proof may be conducted following the same lines, with 
appropriate changes. 


In 1856, Sylvester also announced: 
(5G) The equation 
X? +Y’ + Z? -6XYZ-20 


has no solution in integers different from zero. 
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1.6. The Quintic Equation 


The case n = 5 was first settled by Dirichlet. His paper was read at 
the Academy of Sciences of Paris in 1825, but his proof, published in 
1828, did not consider all the possible cases. Legendre then published 
a complete and independent proof, while Dirichlet was able to settle 
the last remaining case. We reproduce Dirichlet’s proof in modern 
language, using a few facts about the arithmetic of the quadratic field 
K = Q(V5). The proofs may be found, for example, in Ribenboim 
(1999). 

Let A be the ring of integers of Q(V/5). The elements of A are of 
the form (a --bV/5)/2, where a, b are integers of the same parity. The 
invertible elements of A, i.e., the units of K, form a multiplicative 
group. (a+ by/5)/2 is a unit if and only if its norm 


(=) (27 _ a? — Bir 


2 2 4 


is equal to +1 (i.e., a?—5b? = +4). It may be shown that the units of 
A are precisely the elements +((1 + V5)/2)°, where e is any integer. 

An important fact required in the proof is that every element 
of A may be written (up to a unit) in a unique way as a product 
of powers of prime elements. Or, equivalently, every ideal of A is 
cd Among the prime elements of A there are the numbers 
2, V5. 
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We begin by establishing a property concerning certain principal 
ideals of A which are fifth powers. 


(6A) 
(1) Let a,b be nonzero integers such that gcd(a,b) = 1, a Æ b 
(mod 2),5)/a,5 | b. If a? — 50? is the fifth power of an 
element of A then there exist nonzero integers c,d such that 


(6.1) { a = c(c + 50c?d? + 125d"), 


b = 5d(c* + 10c?d? + 5d‘), 
and gcd(c, d) = 1, c#d (mod 2), 5 yc. 
(2) Let a,b be integers such that gcd(a,b) = 1, a,b both odd, 
5 Ya, 5 | b. If (a? —5b?)/4 is the fifth power of an element of 
A, then there exist nonzero integers c, d such that 


(6.2) b = 5d(c* + 10c2d + 5d*)/16, 


and gcd(c, d) = 1, c, d are both odd, 5 Jc. 


{ a = c(c + 50c?d? + 125d*)/16, 


PROOF. If the nonzero integers c, d satisfy (6.1) (respectively, (6.2)) 
then gcd(c,d) = 1,5) c, c,d cannot be both odd, otherwise a,b 
would be both even (respectively, c,d cannot have different parity, 
otherwise 16 would divide c and d, so c,d must be odd). 

Now we prove the existence of c,d, in both cases. 

(1) We first observe that if a4- by/5 = ((h+kv5)/2)> with h =k 
(mod 2), then h,k are even. Indeed, 25b = 5k(h* + 10h?k? + 5k*), 
so 2? divides h* + 10h?k? + 5k*. If h,k are odd then k = +1, +3 
(mod 8), so h? = 1, 9 (mod 16), h* = 1,17 (mod 32) and similarly 
for k, k?, kt. Hence h* + 10h?k? + 5k* is congruent modulo 32 to 
either 


Ss oe. DOS ie. 
3 ok ={ 1 + 90 + 85 = 16, 


or 
17 + 26 +5 = 16, 


2 As 
TRS PR ={ 17 + 234 + 85 = 16, 


which is a contradiction. 

Now we show that gcd(a + by/5, a — bÿ5) = 1. In fact, if a prime 
element a € A divides a + bÿ5 and a — b /5 then a divides 2a and 
2bV5. If a | V5 then a = V5w (w a unit of A); so V5 divides 2a, 5 
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divides 4a? (in Z), and 5 | a, contrary to the hypothesis. Thus a 
divides 2b; since there exist integers s,t such that 2as + 2bt = 2 
then a would divide 2. But 2 is a prime in A, then a = 2w (wa 
unit of A). So 2 divides both a 4- b4/5 and a — b /5, hence 4 divides 
a? — 5b? = (a+ b/5)(a — bv5). But a,b have different parity, so 
a? — 5l? is odd, a contradiction. 

Since gcd(a + bÿ5, a — bÿ5) = 1 and a? — 50? is the fifth power 
of an element of A, it follows from the unique factorization in A 
that a + by/5 is the fifth power of an element, say a + bv5 = ((m + 
nv/5)/2)°, where m =n (mod 2). Hence 


«a (eset (caf) 


2 2 


where t = u (mod 2), (t--uy/5)/2 is a unit of A, and so t?—5u? = +4. 

Let ((m--nV/5)/2)? = (m -- n'//5)/2 so that 16m' = m? (mod 5), 
16n' = 0 (mod 5) and hence 5 | n’. Also 4a = m't + 5n'u, 4b = 
m'u + n't; hence 5 Kan! (otherwise 5 | a, contrary to the hypothesis) 
and therefore 5 Kam. Since 5 | n’, 5| b then 5 | m'uso5|u. If u=0 
then t = +2 and a+ bV5 = +((m+ ny5)/2)?. From the remark at 
the beginning, m,n are even and we put c = +m/2, d = +n/2 and 
it is clear that c, d satisfy the relation (6.1). 

If u Æ 0 then (t + uv5)/2 = +1 and hence (t + uv5)/2 = +((1 + 
V5)/2)° with some exponent e Z 0. Replacing, if necessary, (1 + 
V5) /2 by its inverse, —(1— V/5)/2, we may assume e > 0 and actually 
e > 1 (otherwise u = +1 contrary to the fact that 5 | u). Then 
Be aia (t + uv/5) = (1 am V5}°; therefore 


SEN ZIM boss 


so 2° lu = +e (mod 5) and since 5 | u then 5 | e. Thus e = 5f. Let 


m 4 n5 (55) - c 4 d'/5 


2 2 EE XE 
where d = d (mod 2). Then a + bV5 = +((c + d'V5)/2)°. By the 
remark at the beginning, c', d are even. Let c = +c//2, d = +d'/2. 
Then c, d satisfy relations (6.1). 


(2) The proof in this case is very similar, so we only indicate the 
main steps. First we prove that gcd((a + bV5)/2, (a — bV5)/2 = 1, 
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hence (a + by/5)/2 = ((m + nv/5)/2)?((t + uV5)/2) with m = n 
E 2), t = u (mod 2), t? — 5u? = +4. Then 5 | u. If u = 0 let 
+m, d = +n, so relations (6.2) are satisfied. If u 0 then 
(t + uv5)/2 = +((1 + V5)/2)* with e > 0 (actually e > 1). Then 
e = 5f, and letting 


o, (t5 ( A) - c+dv5 
AL 9 , 


2 2 


it follows that (a+ b/5)/2 = ((c+ d/5)/2)? and again the relations 
(6.2) are satisfied. 


(6B) The equation 
(6.3) X°+Y°+Z7°=0 
has no solution in integers all different from 0. 


PROOF. We assume that there exist nonzero integers x, y, z such that 
x+y? +25 — 0. We may assume that gcd(x,y,z) = 1 and hence 
x,y,z are also pairwise relatively prime. 


First Case: 5 does not divide xyz. 

Then z, y, z are congruent to +1 or +2 (modulo 5). Since z? = x 
(mod 5), y? = y (mod 5), 2° = z (mod 5) thenrz+y+z2=2°+y?+ 
z5 = 0 (mod 5). 

If x,y,z are pairwise incongruent modulo 5 then x + y + z #0 
(mod 5). So, for example, x = y (mod 5). Then —2 = x + y = 2x 
(mod 5). Raising to the fifth power, z? = y? (mod 5°), —2? = 2°2° 
(mod 5?) hence also —25 = x? + y? = 2x? (mod 5?) and therefore 
255? = 2x° (mod 5°), so 2° = 2 (mod 5?), which is not true. This 
proves the proposition in the first case. 


Second Case. 5 | z (for example). 

Then 5 / zy. Since gcd(x, y) = 1 then either x,y are both odd or 
of different parity. 

We first consider the case where x, y are odd. Then —2° = zx? + y? 
is even, so 2,5 divide z. We may write z = 2"5"2', with m > 1,n > 
1, z' not a multiple of 2 or 5. Replacing z by z', we have nonzero 
integers x, y, z, pairwise relatively prime such that 


(6.4) —29mp5n = g5 + y5, 
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with x,y,z odd and not multiples of 5, m > 1, n > 1. Let x + y = 
2p, x — y = 2q (p,q integers not equal to 0) sor =p+q,y=p-Q, 
with gcd(p, al = 1 and p,q not both odd. Then 


= 25" 29 = (p + q)? + (p — q)* = 2p(p* + 10p?q? + 54). 


Since 5 | p or 5 | p* + 10p?q? + 5q* it follows that 5 | p and we write 
p = 5r. So 5 /q and gcd(r, q) = 1, q,r having different parity. Thus 


29 5n25.— 2 x B^r(g* + 500r? -- 128r), 


Let t = qf + 50g?r? + 125r* = (q? + 25r?)? — 5(10r?)?. We put u = 
q? + 25r?, v = 10r?, so u,v are not 0, u is odd, 10 | v, gcd(u, v) = 1. 
Then t is odd, 5 yt, gcd(t,r) = 1, so 5 | r (since 5n > 2). 

Since gcd(2 x Bir, t) = 1 then 2 x 5?r and t are fifth powers of inte- 
gers. But t = u?—5? with u Z v (mod 2), ged(u,v) = 1, 5 Ju, 5 | v. 
By (6A), there exist nonzero integers, c, d such that 


u = c(c* + 50c?d? + 12541), 
v = 5d(c* + 10c?d? + 5d*), 


and gcd(c,d) = 1, c # d (mod 2), 5 / c. From this, it follows that 
5 | d since 5 | r thence 5? | v. We also note that d > 0. Multiplying 
the last relation by 2 x 5? we have (2 x 5’r)? = 2 x 5° x 10r? = 
2 x 5*d(c* + 10c?d? + 5d*) and this number is a fifth power (since 
2 x Dir is a fifth power). 

But gcd(2 x 5*d, ct + 10c?d? + 5d*) = 1 because c + 108? d? + 5d* 
is odd, 5 Ke and gcd(c, d) = 1. Hence 2 x Did and c*+ 10c?d? + 5d* = 
(c? + 5d?)? — 5(2d?)? are fifth powers. Again c? + 5d?, 2d? are not 
both odd, ged(c? +5d?, 2d?) = 1, 5 /c?+5d?, 5 | 24?. By (6A), there 
exist nonzero integers c', d' such that 


c? 5d? = (ct + 50e? d? + 125d’), 
2d? = 5d’(c* + 10c?d? + 5q'^), 
and gcd(c', d') = 1, ds d (mod 2), 5 /c. From this it follows that 
5 | d' because 5? | d?. We also note that d' > 0. Multiplying the last 
relation by 2 x 5° we have 
2? x 58d? = (2 x 54d)? = 2 x 5°d' (c4 + 10^ d^ + 5d"), 


and this number is a fifth power. Since gcd(2 x 5?d', c^ 4: 10e? d? + 
5d'^) = 1 then 2 x 5?d', d'r 10e? d" + 5d” are fifth powers. This is 
analogous to the previous assertion that 2 x 5*d, c* + 10c?d? + 5d* 
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were also fifth powers. Moreover, 0 < d! < d, because 25d” < 
5d'(c'* + 10c7d? + 5d'^) = 2d?, so 0 < d' < 5,/(2d?)/25 < d. If this 
procedure would continue, we would reach an integer d” such that 
0 < d" < 1 and this is absurd. 

It remains to consider the case where x and y are of different 
parity. We omit some details of the computations. 

Let e+ y = p, z — y = q so p,q are odd, gcd(p,q) = 1, 2x = 
p+q, 2y=p—q. Then —2? x 5°%z° = (22)? + (2y) = (p + q)° + 
(p — q)? = 2p(p* + 10p?q? + Beil, Since 5 | p we write p = 5r, so 
5 fa, gcd(q,r) = 1, q,r are both odd, 


RUE 55723 = 2 x bert 


where t = q*+50q?r7+125r* = u? — 5v? with u = q?--25r?, v = 10r?. 
Then u,v are not 0, u,v are even, u = 2 (mod 4), 5 Yt, gcd(t,r) = 
1 so 5 | r (since 5n > 2). We write u = 2u', v = 2v', u’,v’ are 
odd, gcd(u',v) = 1,5 yu, 5 | v. If t' = t/4 = u^ — Bum" then 
t = 0 (mod 4), and —5°"2° = Dirt di, with gcd (5°r, t'/4) = 1. So 
5?r and t'/4 = (u^ — 5v'?)/4 are fifth powers. By (6A) there exist 
nonzero integers c,d such that 


u’ = c(c* + 50c?d? + 125d") /16, 
v' = 5d(c* + 10c?d? + 5d*) /16, 


and gcd(c, d) = 1, c, d are both odd, 5 Fe, Moreover, since 5 | r then 
5? | v' so 5 | d. We note also that d > 0. 
Multiplying the last relation by 5? we have 


(sey sa 


where ((c? + 542) /2)? — Belt = 0 (mod 4). Since the two factors in 
the right-hand side are relatively prime and (5?r)? is a fifth power, 
then 54d and i[((c? + 5d?)/2)? — 5(d?)?] are fifth powers. By (6A), 
there exist nonzero integers c', d' such that 


5*d 
4 


dk = 534/ = 


(c? + 5d?) /2 = d (c4 + 50° d? +125d/*)/16, 
d? = 5d' (df + 10° d? + 5d'*)/16, 


with gcd(c',d') = 1, c,d’ both odd, 5 / c. Moreover, 5 | d and 


1.6. The Quintic Equation 55 


TABLE 3. FLT for the exponent 5. 


Author Case Year 

Gauss both 1863 (posthumous publication) 
Schopis first 1825 

Lebesgue both 1843 

Lamé both 1847 

Gambioli both 1901 and 1903/4 
Werebrusow both 1905 

Mirimanoff first 1909 

Rychlik both 1910 

Hayashi? both 1911 

van der Corput both 1915 

Terjanian both 1987 


d' > 0. Multiplying the last relation by 5° we have 


9, 
55d? = (54d)? = ^s (e^ + 10c?d? + 5d’) 
54 (fei +5d?\" ny 
—. | ; 5(d?)*.. 


Again 5?d' and 1[((c^ -- 5a7)/2)? — 5(d^)?] are fifth powers. This 
is analogous to the previous assertion. Moreover 0 « d' « d, because 
25d? < 16d?. The continuation of this procedure would lead to a 
contradiction. 


In 1912, Plemelj proved the following extension of the preceding 
theorem, see also Nagell (1958): 


(6C) The equation 
X°+Y°4Z°=0 


has only trivial solutions in integers of the number field Q(V5). 


Other proofs of Fermat’s theorem for the exponent 5 are given by 
the authors in Table 3. 


?This proof may be incorrect, according to private communications. 


56 


I. Special Cases 


1825 


1828 


1830 


1843 


1847 


1875 


1901 
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I.7. Fermat's Equation of Degree Seven 


In 1839, Lamé proved Fermat's theorem for the exponent 7. Lebes- 
gue found a simpler proof in 1840. Genocchi devised in 1874 and 
1876 a still simpler proof (using an idea already found in Legendre 
(1830)), which is reproduced in Nagell's book (1951): 


(TA) 
(1) If x,y,z are the roots of a cubic equation with coefficients 
in Q, and x? + y" + z7 = 0 then either xyz = 0 or z,y,z 
are proportional (in a certain order) to the cubic roots of 1, 
namely 1,¢ = (—1 + /-3)/2, ? = (—1 — -3)/2. 
(2) The equation x’ + y* +2" =0 has only the trivial solution in 
integers. 


PROOF. (1) Suppose that x, y, z are the roots of f(X) = X?—pX?+ 
qX — r, with p,q,r € Q. Then 


p-crtytz 
q = TY + Tz + yz, 
r = Dye 
Case I: p = 0. 

We use the identity (see Section II.5) 


(X+Y) -X -Y « 7XY(X - Y)X? + XY + Y?y. 


If r+y+z = 0 and x°+y +27 = 0, then 7zy(z-y)(z?--xy4-y?)? = 0. 
Hence either x = 0, or y = 0, or z = —(x + y) = 0, or xyz Z 0, 
but z? + zy + y? = 0. Therefore (y/x)? + y/x + 1 = 0 and so 
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y/x is a cubic root of 1,y/x #1. Thus y = x¢ (or y = x?) and 
z = —(x+y) = —-x(1+60) = CT, so x, y, z are proportional to 1, C, C? 
(or to 1,¢7,¢). 


Case II: p Æ 0. 
Let k > 1 and sj = x^ +y" + z^, the sum of the kth powers of the 
roots of f(X) = 0. By Newton's formulas: 


$1 =P, 

S2 = 8p — 2q, 

$3 = Sap — S1q + 3T, 

S4 = Sap — S2Q + Sırf, 

S5 = S4p — 834 + S27, 

Sg = S5P — 84d + 837, 

87 = S6P — 85Q + Sal. 
Substituting we obtain 
a ty +2" =p" — 7p°q + Tp r + 14p° — 21p°qr — nai + pr? +7q°r. 
Let m = pq — r € Q. Then 


x +y’ +2" = p! — T(pg — r)(p° — pa q?) + T(pq — ry'p, 
that is, 
a Ey! +2" =p! min — pa tq’) + Tm?p. 
Since x’ +y” +z” = 0 then 
p! —7m(p* — pq - q?) + Tm?p = 0. 
Let q/p? = Q, m/p? = M. Then 
p! — TP M(p* — p*Q — p* Q?) +7p M? = 0, 


SO 


M?—- M(1-Q+Q?)+4=0. 
Since M is a rational number, the discriminant ((1—Q+Q?) /2)?- 
i is the square of a nonzero rational number. Let 2Q—1 = s/t, where 
s,t are relatively prime integers, t > 0. Then Q = (t+ s)/2t, hence 
64¢*|((3t? + s?) /8t?)? — 1] = s* + 6t?s? — t^/7 = u? where u is a 
nonzero rational number. 
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Since 74? is an integer, then u must be an integer, so 7 divides t, 
and therefore 7 Ys. Let t = 7°v, with e > 1, 7 / v. Hence 


(7.1) si E 6 x us? — 2ie ld = 2, 
From (7.1) we deduce that 

(7.2) (s? 4-3 x Tv)? — u? = 64 x 7 lach, 

hence 

(7.3) (8? +3 x 7v? +u)(s? +3 x Pv? — u) = 64 x 77 vf. 


Now we show that gcd(s?+3x 7?*v? +u, s?--3 x ?*v? —u) is a power 
of 2. Indeed, let p Z 2 be any prime such that p | s?--3x 7?*v? +u and 
p|s?-3x *v? — u. Then p | 2u, so p | u. Also p | 2(s? +3 x 7?*v?), 
so p | s? +3 x ?*v?. If p = 7, since e > 1 then 7 = p | s, which is 
a contradiction. If p Æ 7, by (7.3) p | 64 x 7**-!v^, so p | v, hence 
p | s, and this is again a contradiction. Thus the greatest common 
divisor of the two factors in (5.3) is a power of 2. 


Subcase (a): v is odd. 

From (5.3) it follows that 
52 +3 x Tv Lu — 1 Aa 
s +3 x Tw? Eu = BU, 


(7.4) 


where a,b are relatively prime integers, ab = v (so a,b are odd) and 
A, B are even and AB = 64. 
From (7.4) we deduce that 


(7.5) 28? + 6 x Ty? = 74-1 Aat + Bb. 
Noting that if x is odd then x? = 1 (mod 8) and that 7 = —1 
(mod 8) then 
A B A B 
2 a 2e 272 | 74e—1 4 | 4 — 
s = —3 x T a b +7 59 + Gb z-3 SE (mod 8). 


We consider the various possibilities for A, B: 


A=32, B=2 — s?=-—2 (mod 8), impossible, 
A=16, B=4 — s?=-1 (mod 8), impossible, 
A=8, B=8 — s?=-3 (mod 8), impossible, 
A=4, B=16 — s?=83 (mod 8), impossible, 
A=2, B=32 — s?=4 (mod 8). 
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In this last case, (7.5) becomes 
s? +3 kT ek HT a + 160; 
hence multiplying with 64, 
64s? + 6 x 77° x 32a7b? — 64 x 16b* = 74°"! x 64a’, 

SO 

643? — (320? — 8 x T"*a?)* = 7471 x 6407 TE Xa HT, 
and therefore 
(7.6) (8s + 320? — 3 x 7?*a?)(8s — 320? +3 x 77a?) = 7% laf, 


We note that gcd(8s + 320? — 3 x 7?*a?, 8s — 320? +3 x 7?*a?) — 1. 
Indeed, since a is odd, the above numbers are odd. If p | 8s 4- 320? — 
3 x roi and p | 8s — 320? + 3 x ?*a? then p | 16s, so p | s, and 
similarly p | 320? — 3 x 7?*a?. But p | 7**^!a* (by (7.6)). If p = 7 
then p — 7 | s, which is impossible. If p Z 7 then p | a so p | b, again 
a contradiction. This proves the assertion, and therefore 


+ 2 2e,2 _ A 
(7.7) l2 T3x'f*a*-—c 


8s 320? - 3 x 7a? = 74-144, 


where c, d are relatively prime integers, cd = a. Hence c, d are odd. 
From (7.7) we derive the congruence 


T32c* (mod 8), 
which is impossible. We have therefore shown that subcase (a) is 
impossible. 


Subcase (b): v is even. 
Then s is odd (because t is even), hence u is also odd. We write 
(5.2) as 


(s? +3 x Ty)? — y? — 4 x Tf (29)*, 


and noting that both factors of (7.3) are even, and their greatest 
common divisor is 2, we deduce that 


2 2e,2 cl me 4e—1 A4 
(7.8) La tu-2xT*A*, 


s? +3 x Tev um» x Bt, 
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with A, B relatively prime integers, AB = 2v, so either A or B is 
even. Hence 

(7.9) 

s? = —3 x 72°? + 74e E At + B! - —3 x 72° A? B? SE viel 44 + B*. 

If B is even then A is odd and s? = —3v? — 1 (mod 8), hence si = — 


(mod 4), which is impossible. 
If A is even, then B is odd. Let A = 2A, hence from (5.9) 


3? = —3 x 7° A? B? +77! x 24A] + Bt 


and therefore 1 = s? = —3A? + 1 (mod 8); hence 3A? = 0 (mod 8) 
so 4| An, We write A, = 44), hence A = 8A, and (7.9) is rewritten 
as 

s? = —3 x 16 x 77* A2B? +771 x 84 x A5 + B*, 


so 

S—(B^—3x8xT*Ay = 7x8 x A5 — 3? x8 x TAD 
(7.10) = RNA 
This gives 


(7.11)  (s— B? -3x8x 7°A5)(s + B? 3x 8x 7*Ai) 
Ec x 4 x (2A2)*. 


The two factors of (7.11) are even, and it may be seen, as before, 
that their greatest common divisor is 2. Hence 


s F B? 3x8 x TA? = Xe. 
s + B? F3 x8 x 7A? = 7^1 x 2d}, 


(7.12) 


where c», d are relatively prime integers such that cədə = 2A». 
From (7.12) we deduce by subtraction that FB? +3 x 8 x 72° A2 = 
c3 — 7*°-1d5, hence 


(7.13) B? = +6 Meg — c + 7d}. 


Since any nonzero square modulo 7 is congruent to 1,2 or 4, then 
B? +c} #0 (mod 7). So we must have the negative signs in (7.13): 


(7.14) B? = 6 +6 x T Ed — 7 dh. 


This equation is of the same form as (7.1). Moreover, 


AB 
v= CN = 4A,B > 2c5d» > dy. 
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TABLE 4. FLT for the exponent 7. 


Author Cases Year 

Legendre (Sophie Germain) first 1823 (see Chapter II, §3) 
Genocchi both 1864 

Pepin both 1876 

Maillet both 1897 

So, we may proceed by infinite descent and conclude that the 


subcase (b) is also impossible. Thus, case IT is not possible, and this 


proves 


(1). 


(2) If z, y, z are integers such that x’ + y” + 27 = 0, we consider 
the polynomial f(X) = X? — pX? + qX — r, which has the roots 


L,Y, Z. 
TYXZ = 


By (1), since x,y,z cannot be proportional to 1,¢,¢?, then 
0. 


Other proofs of Fermat’s theorem for the exponent 7 are given by 
the authors in Table 4. 


1823 


1839 


1839 


1840 


1840 
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I.8. Other Special Cases 


'There have been numerous papers devoted to the proof of Fermat's 
theorem for special exponents, other than 3, 4, 5, 7. The methods 
used were specific to the exponent in question and in most instances 
not susceptible of generalization. 

We note that according to an oral communication of Terjanian, 
Hayashi's proof (1911) for the second case and exponent 13 has a 
mistake. 

Bréi¢-Kostié studied the equation zi Lu? = zê in 1956 and showed 
that it has no solution in relatively prime integers; however, it has 
nontrivial solutions in integers which are not pairwise relatively prime. 

Now we indicate the elementary proofs of Breusch (1960) for the 
exponents 6 and 10. Of course, the theorem for these exponents 
follows from the truth for the exponents 3 and 5. But the proofs 
which we present will be entirely independent of the above results. 
The following preliminary results, by the method of infinite descent, 
were also proved by Breusch: 
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TABLE 5. FLT for various exponents. 


Author Exponent Case Year 
Kausler 6 both 1806 
Sophie Germain all primes less first 1823 
(Legendre) than 100 (see Chapter IIT) 
Dirichlet 14 both 1832 
Lamé? 11, 17, 23, 29, 41 first 1847 
Matthews 11, 17 first 1885-6 
Tafelmacher* 11, 17, 23, 29 first 1892 
Thue 6 both 1896 
Tafelmacher 6 both 1897 
Lind 6 both 1909 
Mirimanoff 11, 17 first 1909 
Kapferer 6, 10 both 1913 
Swift 6 both 1914 
Kokott 11 first 1915 
Fell 11, 17, 23 first 1943 
Breusch 6, 10 both 1960 
Terjanian 14 both 1974 


(8A) There exist no positive integers x, y, u,v such that 


(8.1) x? + y? Su + v° 
and 
(8.2) zy = 2uv. 


PROOF. Assume the contrary, and among all possible solutions, con- 
sider the one with minimal positive product xy. From this minimal 
choice it follows that no three of the four integers x, y, u,v can have 
a common factor greater than 1. Also gcd(x,y) = 1, because if p is 
any prime dividing x and y then p divides u or v, a contradiction. 
Similarly, gcd(u, v) = 1. Hence one of z, y is even, and the other is 
odd. For example, let 2 | y, so z? + y? is odd, so u or v is even, 
say u is. Therefore from (8.2) we deduce that 4 | y; from z? = 1 


?Lamé's paper (1847) has no proofs. 
4Tafelmacher’s proof (1892) holds only for the first case; see also Dickson, 
History of the Theory of Numbers, Vol. II, pp. 755. 
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(mod 8) it follows that z? 4- y? = 1 (mod 8). Again, by (8.1), u? = 0 
(mod 8), so u = 0 (mod 4) and 8 | y. 
Let y = 2"*ly (with r > 2, y' odd), then u = 2"wv', with u’ odd. 


'Thus 
(8.3) ry’ =u'v and x,y’, u'v are odd, 
` gcd(z, y") = ged(u'o) = 1. 


Moreover, if 
a = gcd(z, u^), b = gcd(z, v), 
c= gcd(y', u’), d = gcd(y', v), 
then a, b, c, d are pairwise relatively prime and b,d are odd, so b? = 
d? = 1 (mod 8). So x = ab, y = cd, w = ac, v = bd. Therefore the 
original equation becomes 
a?b? + 22742022 — 227 G2 2 EE bd. 
Letting t = 2"c, we have 
(8.4) 32 = (a? — d?)(t? — b’), 
where t, a, b, d are pairwise relatively prime and 4 | t. It follows that 
d? divides t? — b? and t? — b? divides 3d. 
'This holds if and only if one of the following conditions is satisfied: 
(D à? — b = 3d?; 
(II) t? — b? = g; 
(III) t? — b? = —3d?; and 
(IV) t? — b? = —d?. 
But t? = 0 (mod 8) and b? = d? = 1 (mod 8). Therefore the cases 


I, II, III are not possible. From t? — b? = —d? it follows that 3t? = 
d? — a?, so 


(8.5) b? — d? E =a? AC. 


By (1A) there exist integers m, n > 0 and integers r,s > 0 such 
that 


b=m +n’, 
Co) { t = 2mn, 
and 
b=r?+8?, 
(8.7) { 2t = 2rs. 
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From these, we obtain the relations 


(8.8) rts =m +n 
and 
(8.9) rs = 2mn. 


Since rs = t = 2"c < 2"y' < y € xy, we have found a new solution 
for the original system, contradicting the minimality of the solution 


(35, Y, u, v). 


(8B) There exist no positive integers x,y,u,v such that 


(8.10) ry -uw y 
and 
(8.11) zy = 2uv. 


PRoor. We assume the contrary and consider positive integers sat- 
isfying the above relations and such that ry is minimal. 

Proceeding as before, one of x,y is even and the other is odd. 
Thus z? — y? = u? + s? = 1 (mod 4); therefore necessarily x is odd 
and y is even. As in the preceding proof, we arrive at equations 
(8.3), (8.4), (8.5), and (8.6) with t = 2^c,r > 2. And we obtain the 
relation a?b? — 2?n*? 2g? = 2?rg? c? + di. hence 


(8.12) 5t?^d? = (a? — d’)(b? — t?). 

So d? divides b? — t? and b? — t? divides 5d?. By the same reasoning 
as previously, taking congruences modulo 8, we see that b? — t? = d?. 
It follows that 51? = a? — d?, hence 

(8.13) Hd 0 and b? + 4t? = a°. 


By (1A) there exist integers m,n > 0 and integers r,s > 0 such 
that 


-— 2 2 
(8.14) TE TNS 


t = 2mn, 


and 


b =r? — s?, 
SES { 2t = 2rs. 
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We obtain the relations 


(8.16) r?— s =m +n 
and 
(8.17) rs — 2mn. 


Since rs « ry, we have again reached a contradiction to the mini- 
mality of the solution (z, y, u, v). 


Using these facts, Breusch proved: 
(8C) There exist no positive integers x,y,z such that x9 4-y$ = 2°. 


PROOF. Assume on the contrary that x,y,z are pairwise relatively 
prime positive integers such that r9 + yê = 26. Then 3 does not 
divide both x and y, so we may assume that 3 yx. We have 


zê = 29 — yf = (z + y)(2 — y) + zy yy - zy y). 


The last two factors in the above product must be odd, since x, y are 
not both even. Moreover, it is easily seen that each factor z?+zy +y? 
and 2? — zy + y? is relatively prime to the three other factors of the 
right-hand side (because 3 / x). Hence by the unique factorization 
of integers, 


{ z+ zyty? =D, 
—zy+y =, 


with integers b > c > 0. Adding and subtracting these relations, we 
have 2(2? + y?) = H + c6 and 2zy = b? — cê. But b,c are odd, so 
D? +c? = 2m, D? — e = 2n (with m > 0,n > 0), so squaring and 
adding, we get b? -- c = 2(m?+ n?), and multiplying, b? — cê = 4mn. 
Hence 2? + y? = m? + n?, zy = 2mn. According to (8A), this is 
impossible. 


(8D) There exist no positive integers x,y,z such that 


r1 yl = z", 
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PROOF. Let x,y,z be positive, pairwise relatively prime integers 
such that zi + y!? = 310, Then 5 does not divide both x and y, 
so we may assume 5 Yx. We have 
gei —y 
= (zy - y) + ey + 27y? tr sai ra) 
x (24 — zy + zy — zy). 


4 


The last two factors in the above product must be odd, since y, z are 
not both even. Moreover, it is easily seen that each of the last two 
factors is relatively prime to the three other factors on the right-hand 
side (since 5 /z). Hence by the unique factorization of integers, 


24 + zy WR zy? + zy? + dp — bo. 
24 — By + zy — zy + yt cd, 


with integers b > c > 0. Adding and subtracting these relations, we 
have 2(z* + z2y? + yt) = b + c9, 2(z3y + zy?) = b — cl. But 
b,c are odd, so b +c? = 2m, D? — c? = 2n (with m > 0,n > 0), so 
squaring and adding, we get NI 2 c!? = 2(m? + n?), and multiplying, 
bt? — cl = Ann. Hence 
z^ HYHY mR, 
{ zy(2°? + y?) = 2mn. 


Now letting 2? + y? = r, zy = s, then r? — s? = m? + m?, rs = 2mn. 
This is impossible by (8B). 


Fermat’s theorem for the exponent 14 was established by Dirichlet 
(1832) before Lamé settled the case of exponent 7. Dirichlet also 
showed: 

(8E) The equation 

xa =. y = omzttn 714 
(with m > 0,n > 0) has no solution in nonzero integers x,y,z with 
x,y relatively prime. 


Terjanian proved in 1974: 


(SF) Ifa is a natural number, a 4 0, multiple of 7, and without 
prime factor p = 1 (mod 7), if x,y,z are natural numbers, x,y # 0, 
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x,y relatively prime, if 


then x = y = 1 andz=0. 


From this, he deduced a simple proof of Fermat’s theorem for the 
exponent 14. 

In 1885, Matthews gave a proof for the first case of Fermat’s the- 
orem for the exponents 11 and 17. In 1948, Fell indicated a distinct 
proof for 11 and claimed that his method also solved the first case 
for the exponents 17 and 23. 
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1.9. Appendix 
Would this Be Fermat’s “Marvelous Proof" ?® 


We can see in Euler’s Opera Posthuma Mathematica et Physica, 
Petropoli, 1862, Vol. 1, pp. 231-232, in “Fragmenta arithmetica 
ex adversariis mathematicis deprompta” the following argument, at- 
tributed to A.J. Lexell. It represents an attempt to apply the method 
of infinite descent to Fermat’s equation, in a way which could have 
been Fermat’s. 

The attempt failed. 

Assume that FLT is false for some exponent n > 2. We may 
assume n to be an odd prime (in the argument we only require n 
to be odd) and that there exist nonzero integers (not necessarily 
positive) a,b,c, such that a” + b" = c" and c is even, a and b are 
odd, a Æ b, and gcd(a, b,c) = Let 


Der 

DT EN = b”) /2, 

2S abe" ii 
SO x,y,z are integers, x is even. Then 


Luz a", 
r—y-b, 


4x? c? x ` 
It follows that 


x 
= gr ty? = (g-Vi2y)?, 


Let d = gcd(z, z), so d = c”~*/2 because gcd(ab, c) = 1. Let x = dx’, 
z = dz’, so x’ is even, gcd(a’, 2’) = 1 and d"*!z'(x" — 42") is a 
square, hence so is 217 — 4z'"), with the two factors relatively 
prime. Thus there exist integers r, s such that 


hence 


5I am indebted to E. Bombieri who called my attention to what follows. 
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So 
r?” — 3? = (r^ + s)(r" — s) = 42”. 
Since r = s (mod 2), then gcd(r" + s, r^ — s) = 2, so 
r” + s= 20?, 
mi — s = 2u”, 
and adding, we obtain r” = t” +u”, which gives a nontrivial solution 


of Fermat’s equation, with r even. If we would have r « c then by 
descent we would reach a contradiction. However, 


so r = c and the descent method is not applicable. 


II 
4 Interludes 


In this chapter we discuss topics which will be required in the sub- 
sequent developments. Their importance is not restricted to their 
applications to Fermat’s last theorem. 


IL1. p-Adic Valuations 


Let p be a prime, let a be a nonzero integer, and let v,(a) be the 
exponent of p in the factorization of a as a product of prime-powers: 


a = p" ?)b, where p fb, 


v,(a) is the p-adic value of a. By convention we also set v,(0) = oo. 
We note: 


vy(ab) = vp(a) + v, (b); 
(1.1) vp(a +b) > min(v,(a), vy(b)); 
if v,(a) < v,(b1), V)(b2),... , vy (bx) 
then vp(a + bı + ba 4- +--+ by) = v,(a). 


If v,(a) = e > 1 we say that p° is the exact power of p dividing a 
and we write p° || a. 
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More generally, if gcd(a, a^) = 1, a’ > 0, we define v, (a/a’) = 
vy(a) — v,(a’). Then for any rational numbers r,s, the above prop- 
erties (1.1) are still satisfied. The mapping v, : Q — ZU {oo} is the 
p-adic valuation of Q. 

We say that r = a/a' is p-integral if v,(r) > 0. Clearly every 
integer is p-integral (for every prime p). The set Z, of p-integral 
rational numbers is a subring of the field Q of rational numbers. 
Explicitly, r € Z, if and only if r = 0 or r = p*a/b with k > 0, b > 
0, gcd(a, b) =1, p fa, p fb. 

If r,s c Q, e > 1, we write r = s (mod p°) when v,(r — s) > e; 
we also say that p* divides r — s (with respect to Zp). This relation 
of congruence satisfies the ordinary properties of the congruence of 
integers modulo a natural number. 

It is also clear that a rational number r is in Z if and only if r is 
p- integral for every prime p. 

The following two results have numerous applications. For any 
real numbers x let [x] denote the unique integer such that [x] < x < 
[x] + 1. [x] is called the integral part of x. 

In 1808, Legendre determined the exact power p"' of the prime p 
that divides a factorial a! (so p™*! does not divide a!). There is a 
very nice expression of m in terms of the p-adic development of a: 


a = akp” + ay ip". +--+ + ap + ao, 
where pë < a < p**! and 0 < a; < p—1 (for i = 0,1,... ,k). The 
integers ao, a1, ... , aj are the digits of a in base p. 


For example, in base 5, we have 328 = 2 x 5? + 3 x 5? +3, so the 
digits of 328 in base 5 are 2, 3, 0, 3. Using the above notation: 


(LA) JIfa>1 then v,(a!) 2 m where 


o fa] a-(aod ai: as) 
kl) p | 


i=l 


PROOF. By definition a! = p"b, where p Yb. Let a = qıp + rı with 
0<m,0< 1" < p; so q = [a/p]. The multiples of p, not bigger 
than a, are p,2p,... ,qp € a. So p? (qi!) = pv’, where p Yb'. Thus 
qı +m, = m, where p™' is the exact power of p which divides ol. 
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Since qı < a, by induction, 


but 


as may be easily verified. So 


alae 


Now we derive the second expression, involving the p-adic digits 
of a = app" + --- + aip + ag. Then 


a 
B = agp’ +... + a, 


a 
E = agp’? +--+ + a, 


Sis] = a, +az(p + 1) +a3(p? +p+1)4+-:: 


+ (pi +p? o pl) 


x np — 1) + @(p° e +--+ +ax(p* — 1)} 
p 


1 
= seq eee 


In 1852, Kummer used Legendre’s result to determine the exact 
power p™ of p dividing a binomial coefficient 


a+b\ (a+b)! 
Geo ab’ 


where a > 1,b > 1. 
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(1B) The exact power of p dividing Kaes is equal to Eo+E1 +: "+64, 


which is the number of “carry-overs” when performing the addition 
of a,b written in base p. 


PROOF. Let 
a = a+ ap + ap", 
b = bo bip hip’, 


where 0 < a; < p— 1,0 € b; € p — 1, and either a; Z 0 or b, Z 0. 
Let S, = 3s Qj, Sp = ee b; be the sums of the p-adic digits of 
a,b. Let c;, 0 € cj € p — 1, and e; = 0 or 1, be defined successively 
as follows: 


ao + bo €oP + Co, 
Eo ai b = epe, 
E1 + a2 + b2 = E2p + ©, 


Et-1 + at + bt = Ep +. 


Multiplying these equations successively by 1, p, p’, ... and adding 
them, 


a+b eop + erp? eap! = ep H Ep H ep 
+ co teypt Leon, 


So, a+b = co d- ep ep! + ep! , and this is the expression 
of a+ b in the base p. Similarly, by adding those equations, 


Sa + S + (Eo + €1-F H 6a) = (Eo & b Ep + Sato — Et 
By Legendre's result, 
(p— Um = (a+b) Bank S,—6 4 S, 
= (p— 1)(eo c & t: t &). 


Hence, the result of Kummer. 


This theorem of Kummer was rediscovered by Lucas in 1878. 
In 1991, Frasnay extended the result replacing integers by p-adic 
integers.! 


l'This result is apparently still unpublished; a preprint was given to the 
author. 


II.2. Cyclotomic Polynomials CT 


The results of Legendre and Kummer have found many applica- 
tions in the so-called p-adic Analysis. 
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II.2. Cyclotomic Polynomials 


Let n > 1 and G, = cos(27/n) + isin(2n/n). So G = 1, @ = —1, 
C3 = (=1 T iv3)/2, Ga = t, G5 = cos 72° + isin 12°, Go = (1 ON iv3)/2, 
etc.... 

All the powers of ¢, are also nth roots of 1, so they are roots of the 
polynomial X” — 1. If w is any nth root of 1, the smallest d > 1 such 
that w = 1 is called the order of the root of unity w and necessarily 
d divides n; we say then that w is a primitive dth root of q. 

The powers Qj for j = 1, 2, ... , n are all distinct with ¢” = 1; 
so Cn is a primitive root of unity of order n. Since there are n nth 
roots of 1, every nth-root of 1 is a power (7. Moreover, as is easily 
seen, ¢/ is a primitive nth root of 1 if and only if gcd(j, n) = 1. Thus 
the number of primitive nth roots of 1 is equal to y(n), where y(n) 
denotes the totient of n and q is Euler's function. 

The nth cyclotomic polynomial is 


(2.1) ®(X)= [[ N-O 


gcd(j,n)=1 


(product for all j, 1 € j < n, gcd(j, n) = 1). It is a monic polyno- 
mial of degree y(n). Since the polynomial remains invariant by the 
permutation of its roots, from Galois theory the coefficients are in 
Z. 
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By grouping the nth roots of 1 according to their order, we obtain 


(2.2) X" 1-2 [[$400. 
d 


From this, or directly, we obtain 


(23) ®,(X)=XP +X”? +.. X +1, 

XP -1 

An" 

— KN "(eu + H 0-2) quce s +1, 


(24) ®,(X)= 


for any prime p and e > 1. 
If m | n and m Z n we have therefore 


(2.5) X” —1 = (X” - 1)&,(X) [[94(X) 


(product for all d, 1 <d<n,d|n,d/m). Let u be the Möbius 
function 


. f Oif the primes p; are not distinct, 
e p= { (—1)" otherwise. 


Then 


(2.7) (X) = ][(xr/4 - 1)". 
din 
We note the following properties: If p is a prime and p divides m 
then 
(2.8) $,,(X) = $,(X") (when p | m). 
If p does not divide m and s > 1 then 


@,,,(X” ) 
(2.9) Spm X) = A. O (when p Im, S a 1). 
We now consider the corresponding homogenized polynomials in 
two indeterminates. Let 


x 
(2.10) ®, (X,Y) =V°Ms,, EN 
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then 
(2.11) X” — Y” = | [ (X,Y). 


dn 
If m | n and m £n then 


(2.12) X" — Y” = (x"—yn)e,(x, Y) [[$4(X.v) 


d 
(product for all d, 1 € d « n, d | n, d / m). We have also 
(d) 
(2.13) $,(x, Y) = IL (x - vy", 
d|n 
and, as before, 


(2.14) Pom X, Y) = En(X?,Y?) (when p | m), 


D(X? yt) 


2.15) ®ysm(X,Y) = a =A 
( ) p ( ) $,, (X» xy ) 


(when p /m, s > 1). 


IL.3. Factors of Binomials 


Let a,b be nonzero distinct integers. In this section we consider 
binomials a” + b" as well as the integers (a" — b")/(a — b) and we 
discuss their factors. 


(3A) | Leta,b be nonzero distinct integers. 
(1) If p Z 2, p fab, and v,(a — b) =e > 1, then v, (a? — 6") = 
e+r for every r > 1. 
(2) If 2 Yab and v; (a — b) = e > 2, then vo (a? — b”) =e+r for 
every r > 1. 
(3) If p is any prime and p | a? — b, then y? | a? — b. 


PROOF. (1) It suffices to show that v,(a? — bP) = e+ 1, and then 
repeat the argument. By hypothesis, a = b + kp*, where p Yk. Then 


a? = bP + H b? pt + H bP? k?p E KP pre, 
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Since p divides (7) for j = 1,...,p — 1, then 


Up (") ie 21-4 je. 


From v,(k?p?°) = pe, it follows that v,(a? — P) =e +1. 

(2) As in (1), it suffices to show that vo(a? — 0?) = e +1. By 
hypothesis, a = b + 2°k with e > 2 and a,b,k odd. Then a? = 
bD? + 2**1k + 2?*k?: since e+ 1 < 2e, then w(a? — b) = e +1. 

(3) By hypothesis, a = a? = b? = b (mod p); raising to the pth 
power, a? = b? (mod p°), so p? | a? — b. 


If n > 1 and a,b are distinct nonzero integers, let 


ge cht, 3 


Kë akon ik, 


a—b ee? 


(3.1) 0, (0,0) = 


By convention we define Qola, b) = 0. We note the following expres- 
sion for Q,(a,b) (n > 1): 
[(a — b) + b|” — b” 

a—b 


(3.2) Qua b) = 


«( i Je — b)b"-? + npr} 
n — 2 


= (a — b)e + nb", 
where e € Z. Also, if n — p is a prime number, then 


(3.3) Q,(a, b) = (a — b)" + pf, 


where f € Z. 

We shall now indicate some properties of the integers Q, (a, b). 
Jacquemet (before 1729) proved (5) below when n — p is an odd 
prime. Euler proved in 1738: if p is an odd prime, a > 1, then 
gcd (Q (a, x1), a +1) = 1 or p; if p divides a + 1 then v,(Q,(a, £1)) 
= 1; moreover, if p Æ 3, a # 2, then Q,(a,+1) is odd and greater 
than 1. 

In 1769 Lagrange proved (4) below, as well as (6) when n = pisa 
prime number. In 1837, Kummer proved (4) when n — p is a prime 
number. In 1888, Sylvester proved a special case of (3). In 1897, F. 
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Lucas proved special cases of (5) and (6). More recently, properties 
(4), (5), (6), and (7) have been established by Inkeri (1946) and 
Vivanti (1947). In the present form, most of the proposition is in 
Moller’s paper (1955). 


(3B) Let n > m > 1, and let a,b be nonzero distinct integers. 
Then 


(1) If n= mq+r with r > 0 then 


Qn(a, b) = a"Q,(a™, Bb”) Qn (a, b) + "1 Q,(a, b). 
If n 2 mq — r with r > 0 then 


Q, (a, b) = Ig "Qua", p") + p^] Qm(a, b) 
= GO morcs b). 


Assuming a,b relatively prime, we have: 


(2) If d = gcd(n, m) then Qala, b) = gcd (Q,(a, b), Qn(a, b)). 
3) [pin Qp(a, b) divides Qn (a, b). 
) gcd(Q, (a, b), a — b) = gcd(n, a — b). 
) If p|a—b, p fn then p /Q,,(a, b). 
) If p is an odd prime dividing a —b then v, (Q, (a, b)) = vy (n). 
) If 4|a— 6 then v (Q.(a, b)) = v2(n). 
If 2|a—b but 4 f a — b then v (Qn(a, b)) > vo(n). 
(8) If n is odd then Q, (a,b) is odd. 
(9) If n is odd and e > 0 then gcd(Q, (a, b), a?" + 57") — 1. 
(10) If every prime factor of n divides a — b then n(a — b) divides 
a” — b^. 


( 
(4 
(5 
(6 
(7 


PROOF. (1) Let n=mq+r. Then 


amat" = pmatr 


Qn(a,b) = ES? 
EE 
i a—b 
p amab arb 2) 
= a x Pm Lr x pna 
c da a d de M eae 
am — pm a—b a—b 


a Qula”, 6”) Qm(a, b) + Q.(a, b)b™. 
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Now let n = mq — r. Then 


la TO aam. b™) + pee Qm(a, b) _ ü "B RO as b) 


aE E A E ed 
= a 
a REM bm a — b a — b 
a” — b" 
—gm- rpm 
a—b 
8 a" — qm-rp(a-1)m + ambn-m mae bn SS a™pr-™ + gmt pn-mtr 
= a—b 
a" — b^ 
= = Q,,(a, b). 
a — b Q ( H ) 


(2) Since a, bare relatively prime then for every k > 1, a and b are 
also relatively prime to Q;(a,b) = a*-!+a*-?b+---+ abt? + dF}. 
By (1), Qa(a, b) divides Q,,(a,b) and Q,,(a,b). Let r,s be positive 
integers such that d = sm — rn (or d = rn — sm). So sm = rn + d 
and by (1), 


Q.(a", 0" )Qm (a, b) = Qsmla, b) = a^Q, (a^, 0")Q, (a, b)+b"" Qala, b). 


If t divides Q,,(a,b) and Q,(a,b) then t divides b” Qala, b); but 
Q,, (a,b) and b are relatively prime, hence t | Qala, b), showing the 
statement. 

(3) By (2), the integers Q,(a, b) (for primes p dividing n) are 
pairwise relatively prime. By (1) if p|n then Q, (a, b) divides Q, (a, b). 
Hence TL, Q,(a, b) divides Q, (a, b). 

(4) Since gcd(a, b) = 1, it follows from (3.2) that gcd(Q,,(a, 5), a— 
b) = gcd(n, a — b). 

(5) This is an obvious consequence of (4). 

(6) Let n = p'm, p fm, r > 0 so v,(n) = r. Let a = a", bı = 
bm. Since p | a — b, by (5) p does not divide Q,,(a,b) = (a" — 
0")/(a— b) = (a, —51)/(a — b), so v; (a; — 61) = v,(a— b) > 1. Hence 
vfa" — M) = v,(a, — bı) +r. Thus 


Up (Qn(a, 0)) = (ET) + H - al =r = v(n). 


a, — bı 


(T) Let n = 2m, 2 / m, r 2 0, so v(n) = r. Let ay = a”, b = 
b". As in (6), v(a1 — b) = v(a — b) =e > 1. If e > 2 then 
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However, if e = 1 then v (a? — b?) > r+ 1 (and it may be greater 
than r + 1), so we can only conclude that vo (Q,(a, b)) > ve(n). 

(8) Ifa#b (mod 2) then a” z b” (mod 2) so Q,(a,b) is odd. 
If a = b (mod 2) since a,b are relatively prime, they are odd. It 
follows that Q,(a, b) = a"! + a" ?b+...+ ab^? +6" is the sum 
of an odd number of odd summands, so it is odd. 

(9) Let p be a prime, r > 1, and p^ | Q,,(a,b), p" | a” +b”. By 
(7), p #2. Since p^ | a” — b” then a?” = b?” (mod p") so p^ | 2a?" 
Therefore p divides a, hence also b, which is not possible. 

(10) Let p be any prime factor of n; by hypothesis p | a — b, 
hence by (6) or (7), v,(n) € v, (Qn(a, b)), therefore v,(n(a — b)) < 
Up(Qn(a, b)(a — b)) = v(a” — b”). Since p is arbitrary, this shows 
that n(a — b) divides a” — b^. 


We indicate now the following complementary result proved by 
Inkeri in 1946: 


(3C) Let p be an odd prime, n > 1, and let a,b be nonzero 


relatively prime integers such Se a x b. Then: 


(1) Qp» (a,b) = Mrs 1 Qla” > py 
(2) If p does not divide a at then the integers a — b, Q,(a, b), 


Q,(a^,bP),...,Q,(a" , be^ 7) are pairwise relatively prime. 
(3) If p | a" — b”, if i,j are integers such that 1 <i<j<n 
then 


ged (Q, (o ^, v) Qu (ar) = p 
ged (a =O; (a" E wr) = p. 


(4) Ifv, (or — b") =e>1 thene >n+1 and v,(a—b) =e-n. 


PROOF. (1) 


er = Bän Olai" y"). 


moi m=1 
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(2) We have a?” — bP” = qoqi -+ qn where qo = a — b, 


a” — bP 


j-1 jpi-1 
qj = qp! — pei T Qp (a" T4 ) i 


By hypothesis, p / q; (for j = 0,1,...,n). If is a prime and I 
divides q;, q; (with 0 € i < j < n), then l Z p and l | a” — b" . Since 


j j 
p! p! a? == bP 
Qpi-i a” , b = ar — p» — pr = Qix1Qi-2'* dj, 


then | divides Q,;-:(a" , br). By (3B)(4), we have | = p, a contra- 
diction. 

(3) Assume that l is a prime, e > 1, and l° divides q; and 
q; (0€ i « j € n). Then l° divides a” — b and also a" — 
p = (a? — bh )qiu : qj-1. Since l° divides Qla”, bP) = qj, 
by (3B) (4) l° divides p, so l° = p. 

(4) If p divides a?” — b?" then p Ya, p yb. From 


n—1 


a= (ae) = (wy = p" ` (mod p), 


n—1 H 


it follows that p | a" — 6?" '. By (3B)(6), v,(Q,(a"" ', b?" )) 
= 1, hence pat — i?" ') = e — 1, with e — 1 > 1. Repeating this 
argument, u(a — b) 2 e— n withe > n4 1. 


We shall need later the following estimates: 
(3D) Letn be odd, n > 3, and let a,b be nonzero distinct integers 


such that a -- b > 1. Then Q,(a,b) > n. The equality holds exactly 
when n = 3, a = 2, b = —1 orn = 3, a = —1, b=2. 


PROOF. Since Q,,(a,b) = Qn(b,a) and a Z b, we may assume with- 
out loss of generality that a > b. If b > 1 then 


Qala, b) =a™*+a"7b+---+ab™? 4671 >1414---+1l=n. 
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If b< —1 then a > 1—b > 2, b? > 1 and 


Q.(a,b) = (a+ b)(a? + atb? -E --- ab^?) + b^! 
> n ebe Et 


= (40-97 p Ais? A 1) +1 


4(n-1)/2 =f 
= 2. F1-2(271-1)41 


= >n 


when n > 3. If we have Q, (a,b) = n then we must have equalities all 
through, and this requires that n = 3, a = 2, b = —1. By symmetry, 
we may have Q,,(a, b) = n also when n = 3, a = —1, b = 2 


We note explicitly that if p is an odd prime, if a > b > 1 then 
Q (a,b) > p. This fact is a special case of the above result, but may 
also be seen directly, since Q,(a,b) = a?^! + a??b + --- + ab? 4 
pP >p. 

Before proceeding, we note: If 1 € b < a, by (2.13), 9, (a,b) > 0. 
If n is odd, n > 3, and a,b are not both even, then ®,(a, b) is 
odd. Also, if n is odd, n > 3, and a,b are not both even, then 
®,,,(a, b) = ®,,(—a, b) is odd. 

Let a,b be nonzero distinct relatively prime integers, let n > 1. 
We say that the prime p is a primitive factor of a” — b" (respectively, 
a” + b") if p divides a” — b" (respectively, a” + b”) but p does not 
divide a” — b™ (respectively, a” + b") for every m, 1 € m < n. We 
first note that if 2 is a primitive factor of a" +b” then n = 1, because 
a = b (mod 2). 

Next we observe: 


(3E)  Ifn 22 then p is a primitive factor of a” + b" if and only 
if p is a primitive factor of a?" — in. 


PROOF. If p is a primitive factor of a?" — 0?" then p Y a” — b” so 
p | a” +b”; moreover if p | a^ +b" where 1 < k < n then p | a?* —b?*, 
so 2k = 2n, showing that p is a primitive factor of a” + b". 
Conversely, if p is a primitive factor of a" + b" (with n > 2) then 
p x 2 and p | a?” en If 1 € k « 2n and p | a* — b*, let k = 2°m, 
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with e > 1 and m odd. Then 
aë — b = (am — pm (com + pm : 


Since 2*7! m < n, by hypothesis p ya? ™ 4-5 ^", so p | a? ^ — 
b" "rm. Repeating this argument, p divides om — b", where m is 
odd. Thus a” = bm (mod p) and also a” = —b" (mod p). Then 
om" = pD"" (mod p), om = —b"" (mod p) hence p | b (because 
p # 2), so p | a, contrary to the hypothesis. Thus p is a primitive 
factor of a?” — bin, 


The first result is very easy to prove! 


(3F) Let a,b be nonzero relatively prime integers, let n > 1, and 
let p be a prime. The following statements are equivalent: 

(1) p is a primitive factor of a" — b^. 

(2) p| a^ — b^, but ifl m «n, m |n, then p fa — bm, 

(3) p| 9,(a, b), but if 1 € m « n, then p Ÿ PB, (a,b). 

(4) p| $,(a,0), but if 1 € m «mn, m |n, then p / &, (a,b). 

(5) p Jb and if b' is such that bb! = 1 (mod p) then the order of 

ab’! modulo p is equal to n. 


PROOF. The implications (1) — (2) and (3) — (4) are trivial. The 
equivalences (1) — (3) and (2) — (4) follow at once from the ex- 
pression a" — b” = J Jan ala, b). 

Noting that (ab’)4 = 1 (mod p) holds if and only if p | a? — 6%, 
then (5) is obviously equivalent to (1) and to (2). 


Let p be a prime not dividing n. Then the congruence X" —1 = 0 
(mod p) does not have double roots, since the derivative nX”~' has 
only the root 0 modulo p. It follows from (2.5) that if a is an integer, 
a #1, and p fa if 9, ;(a) = D (mod p) then 9,,(a) Z 0 (mod p) 
for every m, 1 < m < p — 1, m dividing p — 1. 

From this observation and from (3F) we deduce: If a Z 1 and p 
does not divide a, then a is a primitive root modulo p (i.e., the order 
of a modulo p is p — 1) if and only if , ;(a) = (mod p). 

The following proposition appears in the paper of Birkhoff and 
Vandiver (1904) and once more in Inkeri's paper (1946) for the case 
where n is an odd prime. The inclusion E, C E»; was first shown by 
Legendre (1830). 
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(3G)  Letm 22. With the preceding hypothesis, the following sets 
of primes coincide: 
E: the set of primitive factors of a” — b”; 
E»: the set of primes p such that p = 1 (mod n) and p | $, (a,b); 
and 
Ea: the set of primes p such that p Yn and p | 9, (a,b). 


PROOF. We show that E, C E». By hypothesis, p | a” — b" but 
p f a" — b" for all m, 1 € m « n. By (3F), if bb = 1 (mod p) 
then the order of ab! modulo p is n. Hence n | p — 1. Moreover, if 
1] € m «n, m | n, then ®,,(a,b) divides a” — b”; since p Ya” — b" 
then p Y $,,(a, b). But p divides a” — b” = Il, $,,(a, 0), hence p 
divides 9, (a, b). 

Obviously E» C E3, because if n | p — 1 then n < p, hence p Jn. 

Now we show that E3 C Æ. Clearly p | a” — b”. Suppose p is not 
a primitive factor of a" — b". Then there exists m, 1 € m « n, m |m, 
such that p | a" — 0". From 


a” — b” = o, (a, b)(a"" — p") II $ (a, b), 
dm, din, dý m 


by hypothesis p divides (a” — b”) /(a/" — 5"). We write n = md, a” = 
ou, 6” = bı, hence 


min 


a" — b^ ad > bd d—1 Bä 
ENEE 


i=0 


But p | a hu, so p Fox, hence p | d, so p | n, which is a contradiction, 
concluding the proof. 


Next we prove: 


(3H) Let a,b be nomzero distinct relatively prime integers, and 
nz 1. Let p be a primitive factor of a^ — b, let v,(a4 — b?) =r >1 
and assume that r > 2 if p — 2. Then 

(1) vy (9a(a, b)) = r; 

(2) ift > 1 then v, (64,:(a, b)) = 1; and 

(3) ift>0, k >1, pyk then v,(O,4,:(a, DI) = 0. 


PROOF. (1) By (3F), p /$;(a, b) for all l, 1 < l < d. It follows that 
Up(Pa(a, b)) = vy (a* — b°). 
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(2) We have, by (2.12), 


t—1 t—1 


a? — y" = Pap (a,b) (a — b” 


II Pept (a, b). 


eld, 1&e«d 


We note that p Y ®ept (a,b) when e < d, otherwise p | a^? — rr ` since 
p is a primitive factor of a^ — b? then by (3F)(5), d divides ep‘; but 
p /d so d | e, a contradiction. But by (3B)(6) and (3B) (7), 


uy (a* >. pt) — r4 t, 


vy (at ` — pe) i pb LL 
Therefore v, (Par (a, b)) = 1. 
(3) We have 
atte’ pr — i (a, b) II ®,(a,b) - Lei = pu) 


e|dkp*, e dpt, e<dkpt 


By (3B)(6) and (3B)(7), v,(a*»' — bi) =r +t, v, (ad? — pt») = 
r+ t, hence v,(®axpt(a, b)) = 0. 


For every integer n > 2, let P[n] denote the largest prime factor 
of n. 


(31) Leta>b>1, let gcd(a, b) = 1, and let n > 2. Let p be a 
primitive factor of af — b/ such that p | ®,(a,b). Then: 

(1) There exists j > 0 such that n = fp! with p Yf. 

(2) If j » 0, then p = P[n]. 

(3) Ifj > 0 and p? | 9, (a, 0), then n = p = 2. 

(4) ged(®, (a, b),n) = 1 or Pin]. 


PROOF. (1) By (2.11), 9,(a,b) divides a" — b"; then p | a” — b”, 
hence f | n by (3F). Since p | a^^! — b-t}, again f | p— 1, so f < p. 
Let n = fp!w with j > 0, p y fw. Write r = fp’. By (3.2), 
a” — b” 
a” — br 


= wb"! (mod a" — b"). 


Since p | a^ — b” (because f | r), then 
a” — b” 


oR = wb”! (mod p). 
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If n < m then by (2.12), 
py b, (because gcd(a, b) 
n= fp. 

(2) From f < p, if j > 0, then p = PI[n]. 

(3) Let j > 1 and s = fp/^!, so n = ps. Then 


$, (a,b) divides (a^ — 0")/(a" — b"). Since 
= 1), then p | w, which is absurd. So 


om — bm 8 [(a® — bei + bs]? — bet 
as — bs = as — bs 


= pb°@—1) + B EN uf b*)ps(»-2 + H EN = bP) 77-9) 


Tec (af — bel, 


If p > 3, since p | a? — be, then 


a^ — b” 
as — bs 


On the other hand, by (2.12), 9, (a, 5) divides (a” — b")/(a* — b°), 
hence p? | $, (a,b). Thus, if p? | 9, (a, b), then necessarily p = 2. So 
f <p—1implies f = 1 and n = 27, so $6, (a,b) #0 (mod r), which 
is absurd. This shows that 7 = 1 and n = 2. 

(4) Assume that there exists a prime p dividing gcd(®,,(a, b), n). 
By (1) and (2), p = Pin]. By (3), if p | gcd(9, (a,b), n), then 
n = p = 2, so p? Yn. This shows the assertion. 


=p (mod p°). 


The following very interesting theorem was proved by Bang (1886) 
in a particular case. In 1892, Zsigmondy proved the stronger ver- 
sion presented here. It was rediscovered by Birkhoff and Vandiver 
(1904) and by various other mathematicians, like Dickson (1905), 
Carmichael (1913), Kanold (1950), Artin (1955), Hering (1974), Lü- 
neburg (1981) and maybe others. 


(3J) Leta>b>1, gced(a,b)=1,n>1. 
(1) a” — b" has a primitive factor, with the following exceptions: 
(a) n=1,a—-b=1; 
(b) n= 2, a+b a power of 2; and 
(e) n=6,a=2,b=1. 
(2) a" +b” has a primitive factor, with the following exception: 
n=3,a=2,b=1. 
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PROOF. (1) It is clear that in cases (a), (b), (c), a” — b” does not 
have a primitive factor. If n = 1 and a — b does not have a primitive 
factor, then a — b — 1. 

Let n = 2 and assume that a? — b? does not have a primitive factor. 
From a? — b? = (a+ b)(a — b) and gcd(a + b, a — b) = 1 or 2, if p 
is an odd prime dividing a + b, then p divides a? — b?. But p is not 
a primitive factor, so p | a — b, hence p divides a and b, which is 
absurd. This shows that a 4- b is a power of 2. 

Now let n > 3 and assume again that a" — b" does not have a 
primitive factor. Let p = P[n] and v,($, (a, b)) = j > 0. Define 


®,,(a, 
$9; (a,5) = POP). 
p 
(1°) Assume that ®*(a,b) = 1. Let G1, C2, ... Get be the primi- 
tive nth roots of 1. From 
a a 
— Gio! = |y — Gi lh) esae 
la — CN 5 E SE ) a—b 
and a previous remark, 
(n) 


9, (a,b) = |®,(a,6)| = [[ la — Gb] > (a — 0)*? > 1 = 9; (a, 0). 
i=1 
So j > 1 and p | ®,,(a,b), hence p divides a” — b^; so p is a primitive 
factor of af — bf, where f divides n. By (31), gcd(n, ®(a,b)) = p and 
also p? Y 9, (a, b). 

In conclusion, o, (a,b) = p, because ®*(a,b) = 1. Moreover, from 
p | n, it follows that p — 1 divides y(n). This implies in turn that 
p = 9, (a,b) > (a — b)* > (a — b)?-!, hence a — b = 1. 

If p? | n let n = pm, then p — 1 € (m) and by (2.14) 

p= 9, (a,b) = m (aP — b?) > (a? — pr) e) > (a? — bt, 
because p | m. Thus a? — b? = 1, which is not compatible with 
a—b=1. 

Thus, from (3I), n = pf, p Y f, where p is a primitive factor of 
af — bf. Note also that f | p— 1, so f < p. From y(n) = (p— l)e(f) 
it follows that 
p(a?—b?) > p(a! —b^) > ®,,(a,b)®5 (a,b) = ,(a"—b^) > (aP—pP)*(, 


using (2.12). Therefore p > (a? — b?)?f)-!, hence necessarily (f) = 
1, thus f = 1 or f = 2, so n = p or n= 2p. 
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If n = p, then p = ©, (a, b) = a?^! + a? -7b+---+ ab? 2+ b-t = 
(a? — b?)/(a — b) = a? — b”, and this is absurd because a — b = 1. If 
n = 2p, from 3 < p it follows from (2.12) that 


a? + b? 
a+b 


p= $», (a, b) = 


By (3D), necessarily a = 2, b= 1, and p= 3, so n = 6. 

(2°) Assume that a” — b” does not have a primitive factor. If 
suffices to show that ®*(a,b) = 1 and the result follows from (1°). 
Let p be a prime dividing ®,,(a,b), so p | a” — b”. Then there exists 
f, dividing n, 1 € f < n, such that p is a primitive factor of af — bf. 
By (31), p = P[n] and 9,(a,b) = p? with j > 1. Hence 97 (a,b) = 1. 

(2 Ifn=3,a=2, b = 1, then a^ +b” = 2? +1 has no primitive 
factor. Conversely, if n = 1 and a+ b > 2, so there is a primitive 
factor. 

If n = 2 and a?+0? does not have a primitive factor, then a? 4-0? = 
2* (with k > 2). Indeed, if p is an odd prime dividing a? + b?, then 
p | a+b, so p | a? — b?, hence p | 2a?; it follows that p | a and also 
p | b, which is absurd. From a? + b? = 2* (k > 2), ged(a, 6) = 1, 
it follows that a,b are odd, hence a? + b? = 2 (mod 4), which is a 
contradiction, proving that a? + b? has a primitive factor. 

If n > 3, it follows from (1) that a?" — D?" has a primitive factor 
p with the only exception n = 3, a= 2, b = 1. If p= 2 then a,b are 
odd, so 2 | a+ b, which is not compatible with 2 being a primitive 
factor of a” — b”. 

By (3E), a” + b” has a primitive factor, with the exception indi- 
cated. 


It follows from this theorem and (3F) that if a > 2, then each 
number in the sequence 


$3(a), $,(a), $5(a), Pe(a), (a), SE 


(with s(a) deleted when a = 2) has a prime factor which is not a 
factor of any of the preceding numbers. 
The following results are also of interest: 


(3K) Let 1 € m « n, and a > b > 1, with gcd(a,b) = 1. If 
gcd(9,, (a, b), 
$, (a, b)) Z 1, then P[n] = gcd(9,, (a, b), ®,,(a, b)). 
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PROOF. If n = 2, then m = 1. If gcd(a — b, a +b) Æ 1, then 
gcd(a — b, a+b) = 2. 

Now assume n > 3. Let p be a prime and let e > 1 be such 
that p° | Pm(a, b), p° | ®,(a,b). Thenpl|a"—b",pl|la^—b^, 
so p is not a primitive factor of a^ — b^. By (3G) p | n and 
by (31), p = Dal, $,(a,b) = pc, p Y c, soe = 1. Since p was 
an arbitrary common divisor of ®,,(a,b), n (a,b), this proves that 
Pn] = gcd(9,, (a, b), $, (a, b)). 


(3L) Let p be any prime, let 0 € i < j, and let a > b > 
1, gcd(a, b) = 1. Then 


ged(y (a,b), 9,00) = À, fola 
PROOF. By (3K), if d 2 gcd(®,:(a, b), ®,;(a,b)) Z 1 then d = p. 
Assume first that p Z 2. If p | a — b, then a” =a=b=b ` 
(mod p) so by (3B), p divides ©, (a, b) = (a”’ — b”’)/(a””* — bP”). 
Similarly, p divides ®,:(a,b). Finally, if p / a — b, then ur Emu 
b = b” (mod p), so p Ya” — b” and a fortiori p Ÿ $,;(a, b). Thus, 
gcd(®,: (a, b), $,;(a, b)) = 1. 
If p = 2, then ®,(a,b) = a — b and ®(a,b) = a? +b? (for 
k > 1). So if a = b (mod 2), then 2 divides gcd(®z: (a, b), 5; (a, b)), 
and conversely. 


The following corollary of (3L) will be useful: 


(3M) Ifa>b>1 are integers and n > 2 then Pla" — b"] >n 
and Pla” + b"] > 2n. 


PROOF. We may assume without loss of generality that gcd(a, hi 
= 1. Indeed, let d = gcd(a,b) and let a = dou, b = dbi, so a, > 
bı > 1 and gcd(a,,5,) = 1. Moreover, a” + b” = d” (a? + bt), hence 
Pla? + b}] € Pla” + b"|; so it suffices to show that n < Pla} — bi] 
and 2n < Pla? + bi]. Thus, we assume gcd(a, b) = 1. 

(1) Ifa=2,b= 1, n = 6 then a” —b” = 929—163 = 3° x 7 and 
P[29 — 1] = 7 > 6. In the other cases, by (3L) let p be a primitive 
factor of a” — b”. By (3G), p = 1 (mod n) so p = 1 + kn, hence 
Pla” — ol pnm. 


(2) 
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By (3L), let p be a primitive factor of a?" — bin. By (3G), 


p = 1 (mod 2n), so p = 1 + 2kn > 2n. By a previous remark p is 


also a primitive factor of a" + b". Hence Pla” + b"| > p > 2n. 


1729 


1738 


1769 


1830 


1837 


1886 


1888 


1892 


1897 


1904 


1905 
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IL4. The Resultant and Discriminant of Polynomials 


Let 


F(X,Y) = aX” +a X" Y 4---+a,Y" (with ao £ 0), 
G(X,Y) = byX" +b X” A baim (with bo #0), 


where the coefficients a;, b; belong to an integral domain A. 

We shall define the resultant of F, G, denoted by R(F,G) or also 
by Res(F, G). First, if m = 0 we define R(F, bo) = bj, while if n = 0 
we define R(ao, G) = aj’. In particular, R(ao, bo) = 1. Next, ifm Æ 0 
and n Æ 0 we define R(f,g) to be the determinant of the following 
matrix with m + n rows and columns: 


On Q1 SS eee An 0 0 

0 On ay m m An 0 

0 0 eee ao Q1 m An . 

U^ 205 $45. bo bo bml’ 

O bo bi o sexe 0 

libi Sec des gro At 00 
note that there are m rows containing a6,... , a, as entries, followed 
by n rows with bo,... , Om. 


R(F,G) is called the resultant of F(X,Y) and G(X,Y). R(F,G) 
is a polynomial with coefficients in Z of degree m in the coefficients 
a; and of degree n in the coefficients b;. 

The resultant of OF(X,Y)/0X, OF(X,Y)/OY is called the dis- 
criminant of F(X,Y): 

e OF OF 
Discr(F) —R Ei 5r) N 

We recall the following well-known properties (see Bôcher (1907) 

or Cohn (1974)): 


(4A) Let F(X,Y), G(X,Y) be binary forms of degrees, respec- 
tively, n 2 1,m > 1. Then: 


(1) F(X,Y) has a (nonconstant) factor proportional to a factor 
of G(X, Y) if and only if R(F,G) = 0. 
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(2) If n 2 2 then F(X,Y) has a multiple linear factor if and only 
if Discr(F) — 0. 

(3) IF F(X, Y) = Ita (aX -aY) and G(X, Y) = IT. (8/X — 
BY) (with a, ai’, Bi, Bj" elements of a field containing the 
coefficients of F, G and a; # 0 for each i and B; # 0 for 
each j) then 


TL 


R(F.G) = [LG œ) = (-1)"" TI F(8,,8/) 


= (Sa — pja) = (-1)"^ | [(4 8; = da 


In particular, R(G, F) = (-1)"" R(F, G). 

(4) If H(X,Y) is also a binary form of degree l, then R(FG,H) 
— R(F, H)R(G, H) and R(H, FG) = R(H, F)R(H,G). 

(5) If n 2 m and H(X,Y) is a form of degree deg(H) = n — m, 
then R(F — HG,G) — R(F,G). Similarly, if m 2 n and 
K(X,Y) is a form of degree deg(K) = m — n then R(F,G — 
KF) = R(F,G). 


Now let f(X),g(X) be any nonzero polynomials of degrees n, m, 
respectively. Let F(X,Y) = Y” f (X/Y), G(X,Y) =Y™g(X/Y) so 
F (X,Y), G(X,Y) are binary forms of degrees n, m, respectively. 

The resultant of f,g is by definition R(f,g) — R(F,G). The dis- 
criminant of f is, by definition: Discr(f) — R(f, f"). 

If n 21,m > land F(X,Y) = TI, (o4 X — o;Y), G(X, Y) = 
II. (8; X — 8;Y) (as in (1A)), with od 4 0, 8;' 0, then a;/a,' 
are the roots of f(X), 8;/0; are the roots of g( X). 

If f(X) = aX" c aX”! +--+ an (with n > 1, ao z 0), let 
F(X,Y) = Y" f (X/Y) = IIa (a/X — aY). 

The discriminants of the polynomial f (X) and of the binary form 
F(X,Y) are related as follows: 


(4.1) Discr(F) — — Discr(f). 


Indeed, the derivative of f(X) is 
FX) = nayX"^! -(n—1)a X"? + + ani; 


the corresponding binary form is G(X,Y) = Y" !f'(X/Y). On 
the other hand, OF/OX = Y" !f'(X/Y) = G(X,Y), OF/0Y = 
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nY" 1 f(X/Y) — XY"?f (X/Y) = (1/Y) [nF (X,Y) — XG(X,Y)]. 
So, we have, on the one hand 


R (5x Yr) =R EN R (5x ov) = nag Discr(F). 


OX’ OY OX OX’ OY 
On the other hand, 
OF _OF 
R (YS) = R(G,nF — XG) = R(G,nF) 


= R(G,n) - R(G, F) = n™1(-1)"° YR(F,G) 
= n" 1(-1)e-9 R(f, f^) = n"! Discr(f). 


For the convenience of the reader we write explicitly the prop- 
erties of the resultant and the discriminant for polynomials in one 
indeterminate. 


(4B) For polynomials f,g,h,k, with deg(f) =n, deg(g) = m: 
(2) If n € m and deg(h) € m — n then R(f,g) = R(f, g + fh). 
(3) R(hk, g) = R(h, g) - R(k, 9), 

Rig, hk) = R(g, h) - R(g, k). 

(4) R(f* Ek [R(f,g)]? for every integer s > 1. 

(5) R(X — a), g) = [g(a)? where a € A, s > 1. 

(8) If f = ax IIa (X — o4) and  — bo IT. (X — &) then 


Ria = ato JI Tœ- 2 
= of Dain 
= (yop Tro. 


(7) If f = ao IDA CX — o5), then 
Discr(f) — (— 1) n(n— 1/28 SE ‘TI SE a;)? | 


(8) If f = hk, deg(h) = r, and deg(k) = s, then 
Discr(f) = (—1)"*Discr(h)Discr(k)[R(h, k)]?. 
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(4C) 
(1) If f,g € A[X] are nonconstant and R(f,g) #0, then f,g are 
relatively prime. 
(2) If A= K is a field, and if f, g € K|X] are relatively prime, 
then R(f,g) #0. 
PROOF. (1) Assume that f,g have a common nonconstant factor 
h € A[X]. So f = hf; and g = bo, By (4B)(3), 
RC; g) = R(h, h) ` R(h, 91) li T h) I R(f1, 91) = 0. 


(2) Assume that f,g € K|X] are relatively prime. By Bézout’s 
theorem, there exist fi,g: € K[X] such that gif + fig = 1; in 
particular, deg(gı f) = deg(fig). By (4B)(3), 


R(gı f, fig) = Rio, fi)  R(gi g) RCF, fi) + RCF, 9). 
If R(f, g) = 0, then Rio f, fig) = 0. However, by (4B)(2), 
R(gif, fig) = R(gif, pe gf) = Rio f, 1) = 1, 


which is a contradiction. 


Bibliography 


1907 Bôcher, M., Introduction to Higher Algebra, Macmillan, New 
York, 1907; reprinted in 1947. 
1974 Cohn, P.M., Algebra, Vol. I, Wiley, New York, 1974. 


III 
Algebraic Restrictions 


on Hypothetical Solutions 


Assume that n > 3 and x,y,z are nonzero pairwise relatively prime 
integers such that 


x” + y" — z”. 
In this chapter we derive algebraic relations which must be satisfied 
by x,y,z,n. In some cases, these lead to a contradiction showing 


that for the exponent in question, Fermat’s equation has only the 
trivial solution. 


IIL.1. The Relations of Barlow 


Let p be an odd prime and suppose that there exist nonzero pairwise 
relatively prime integers x,y,z such that x? + yP + z? = 0. To 
begin, we observe that «+ y +z z 0. Indeed, x,y,z cannot be 
all positive (nor all negative), so we assume, for example, that x > 
0, y > 0 and z < 0. Then (x+y)? > £? +y’ = —z? since x+y > —2, 
thus x + y +z #0. 

We shall indicate relations which the integers x, y, z must satisfy. 

The first such results were proved by Barlow (1810, 1811) and 
discovered independently by Abel in 1823, who stated them with- 
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out proof, in a letter to Holmboe. The results below were given 
with complete proofs by Legendre, as early as 1823, and were known 
to Sophie Germain. Later, they were rediscovered by Lindemann 
(1901, 1907) and appeared in papers by Catalan (1886), Tafelmacher 
(1892), Fleck (1909), Lind (1910), Bachmann (1919), James (1938), 
Raclis (1944), etc. 


(LA) [If there exist nonzero integers x,y,z such that x? + y? + 2? = 
0, gcd(z, y, z) = 1 and p does not divide z, then there exist relatively 
prime integers t,t,, not multiples of p, such that 


xP + yP 


=? z = —tt4. 
r4y 1» 1 


Luz H, 
Moreover, t, is odd and t4 > 1. 


PROOF. From the hypothesis, x,y,z are pairwise relatively prime. 
Consider the integer 


(1.1) 
Q(z, —y) TA eae = 


ar! "m y? y ie ry? St y^. 
r-cy 


Since z +y +z = x? + yP +z? =0 (mod p) and p/z then p Yz +y. 
By Chapter II, (3B), gcd(x + y, Q,(x, —y)) = 1. 

From (—z)? = x? + y? = («+ y)Q,(x, —y) we conclude that x + 
y. Q,(x, —y) are pth powers, i.e., there exist integers t,t, such that 
£ +y = t, Q (x, —y) = t$, so —z = tt and ged(t, ti) = 1. 

We show that tı is odd. From (1.1) we see that Q,(x, —y) is the 
sum of an odd number of terms; among these terms, x?~' or ul is 
odd (because x, y are not both even). Thus Q,(x, —y) must be odd, 
hence tı is also odd. Finally, from (1E) since x > y (or y > x) then 
z—y21(ory- æ > 1), hence & = Q,(z, —y) = Q,(y,—2) > p, so 
tı > 0 and in fact t, > 1. 


If x,y,z satisfy x? + y? + z? = 0, if p does not divide x,y, z and 
if gcd(x, y, zl = 1 then by the previous result there exist integers 
r,s, t, rı, 81, D, not multiples of p, such that 


aty=t, (æ +y) /(z+y)=t, z 
(1.2) y+z=, EN “Te 2”) /(y SL z) = ris T— TT, 
tas, Latar al d, y 
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Moreover, r,s,t,71,51,t; are pairwise relatively prime, Tu, gn, Du are 
odd and greater than 1. We note that r? + 5? +t? = 2(a+y+z) z 0. 
By addition and subtraction, it follows that 


y = —sP + (rP + s? + tP)/2 = (r? — s? + tP)/2, 


£ = —r? + (r? + s? + 09)/2 = (—r? + s? + 19)/2, 
(1.3) 
z = DD + (r? + s? + tP)/2 = (r? + s? GU. 


We have the following complement of (1A) known to Sophie Ger- 
main and reproduced by Legendre (1823). Proofs were also given 
by Fleck (1909), Lind (1910), Pomey (1923) and again by Spunar 
(1928), James (1938), Pérez-Cacho (1958), and Draeger (1959) (in a 
different form). 


(1B) [fp is an odd prime not dividing z, every prime divisor q of 
t, is congruent to 1 modulo 2p. In particular, tı = 1 (mod 2p). Jf, 
moreover, p does not divide xyz, then every prime divisor of risit: is 
congruent to 1 modulo 2p°. In particular, rı = 1 (mod 2p°), s, = 1 
(mod 2p?) and t, = 1 (mod 2p°). 


PROOF. Let q be a prime dividing tı. Then q divides x? + y? but 
q does not divide x + y = t? because gcd(t, t1) = 1. By Chapter II, 
(3G), q= 1 (mod p). Since q — 1 is even then q = 1 (mod 2p). 

Now we suppose that p / xyz and that q is a prime dividing ri, 
hence q divides x. Therefore q does not divide yz. We note also that 
gcd(r, r1) = 1, so q does not divide y + z. 

Thus we have y = t? (mod q), z = s? (mod q), hence t? + 5s? = 
y+z #0 (mod q) and t? 4 SP = yP + 2? = x =0 (mod q). So q 
is a primitive factor of t? +s”. By Chapter IL (3E), q is a primitive 
factor of #2P° — sët. hence by Chapter II, (3G), q = 1 (mod 2p?). In 
particular, rı = 1 (mod 2p’). 

The proof is similar for the prime factors of sı and tı. 


Now we give the relations which must be satisfied by would-be 
solutions in the second case; these facts (including n > 2) were 
known to Sophie Germain and were given by Legendre (1823). 


(1C) Let x,y,z be nonzero integers such that p divides z, x? + 
y? + z? = 0 and gcd(x,y,z) = 1. Then there exist an integer n > 2 
and pairwise relatively prime integers rs tr, 81,011, not multiples 
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of p, such that rı,sı,tı are odd and greater than 1 and satisfy the 
relations 


(1.4) 
vy p", (x? + y?)/(a@+y) = pth, z= -p"tt, 
ud ee (yP-czP)(ytz)—ri, v= rn, 
z+x= sp, (2? +aP)/(z+x) =s}, y--ss. 


Moreover, if q is any prime dividing t, then q = 1 (mod p?); in 
particular, t = 1 (mod 2p’). 


PROOF. x,y,z are pairwise relatively prime. If p divides z then p 
does not divide x nor y, and from x +y +z = x? +y +2? =0 
(mod p) it follows that x + y = —z = 0 (mod p). 

Let m > 2 be such that x + y = p"-!t', where p does not divide 
t. Let Q,(x, —y) = (x? +y?)/(a+y). Since p divides u = x+y then 
by Chapter II, (3B)(6), v, (Q,(x, —y)) = v,(p) = 1. So Q,(z, —y) = 
pti, pf ti'. By Chapter I, (3B)(4), gcd(x + y, Q,(x, —y)) = p, so 
gcd(t’, ty’) = 1. 

Since —z? = x? + y? = p"t't, then by unique factorization p 
divides m, and t', t, ' are pth powers of integers. We may write 
m = pn, 


xP + y? 
= pti, 
T +7 
z = —p'tti, 


where n > 1, t,t, € Z, gcd(t, t1) = 1, p does not divide ¢ nor t. 
Since p Yx and p Jy, by (1A) there exist integers r, r1, s, sı, not 
multiples of p, such that 


y+z=r, (yP +2)/(y+2)=r?, x—-rn, 


z+r= 2P, (2? + xP)/(z +x) = si, y = —881, 


and gcd(t, r, s) = gcd(t1, r1, s1) = 1, gcd(r, r1) = ged(s, al = 1. The 
proofs that rı, sı,tı are odd and that rı > 1, sı > 1 are the same as 
in (1A). 

Now we show that t; > 1. By Chapter II, (3D), pt? = (x? + 
yP)/(x + y) = Q»(z,—y) = Qp(y,-x) > p, hence tı > 1 and it 
suffices to show that tı Z 1. If tı = 1 then again by the same result, 


II.1. The Relations of Barlow 103 


assuming for example x > y, this would imply: p = 3, x = 2, y= 1, 
hence 2? + 1? + 2? = 0 which is impossible. 

Now we show that if q is any prime factor of tı then q = 1 
(mod p?). We have z = 0 (mod q) hence y = r? (mod q), x = s? 
(mod q) and 0 = —a? + y? + z? =r?’ + s” (mod q). On the other 
hand, q does not divide r? + s? (otherwise q divides x + y hence q 
divides t, contrary to the fact that gcd(t,t,) = 1). It follows from 
Chapter IL, (3D), that q = 1 (mod p?). This implies that tı = 1 
(mod 2p°). 

It remains to show that n > 2. In fact, rP? + s? = 2z + (x + y) = 
—2p"tt, + p?"-1t? = 0 (mod p). By Chapter II, (3H), r? + s» = 0 
(mod p?). Since p | tt, it follows that 2z = (r? + s?) — p?" t» = 0 
(mod p°), so n > 2. 


From (1.1) we deduce that r? + s? + pp = 2(x + y +z) z 0. 

The fact that the exact power of p dividing z +y is p^! has been 
proved again and again (even in 1955 by Stone) by authors unaware 
that this result has been known for a long time. 

We write the relations analogous to (1.3), assuming that p | z: 


(1.5) 
x= rP + (rP + s? + pert tP)/2 = (—r? + s? + perl tP) /2, 
y = —s? + (r? + s" + pP" t9)/2 = (rP — s? + pr it?) /2, 
z = —pP"- HP? + (rP + s? + pr lt?) /2 = (rP + s? — pP^-119)/2. 


In the case of a squarefree exponent it is still possible to indicate 
some relations which are reminiscent of the Barlow relations; see 
Stewart (1977): 


(1D) Ifn > 2 is a square-free integer, if x,y,z are nonzero pairwise 
relatively prime integers such that x” + y" = 2" (respectively, if n 
is odd and x" — y" = z”), then z — y = 2"d"-!a" (respectively, 
2 + y = 2"“d" la") where a, d are natural numbers, u is equal to 0 or 
1, and 2" and d divide n. 


PROOF. We first consider the case where x” + y^ = z” and we write 
z — y = a'a” where a,a’ > 1 and for every prime p the p-adic value 
of a’ is Up(a’) < n. 

If p is a prime dividing a’ then p divides n. Otherwise, p / n 
and by Chapter II, (3B)(5), p / Qn(z,y) = (z" — y")/(s — y). So 
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(a) + nv, (a) = vp(z — y) = Vp(2” — y^) = v(x”) = nu,(y). Hence 
n divides v,(a’) and n < v,(a’), which contradicts the hypothesis. 
This shows that p | n, so v,(n) = 1, because n is square free. 

Now we determine v,(a’) when p divides a’. First, let p be an 
odd prime. By Chapter II, (3B)(6), v, (Qn(z,y)) = v,(n) = 1. So 
v,(a’)+nv,(a)+1 = v,(z—y)--vo(Qu(z, y)) = vo (2^ y") = v(x") = 
nv, (x); hence v, (a^) = —1 (mod n) and therefore v, (a^) = n — 1. 

Now let p = 2. If 4| z — y, by the result already quoted, Part 
(7), we have vo(Qn(z, y)) = ve(n) = 1, so as above we conclude that 
v (a) = n— 1. I£ 2| z — y but 4f z — y then v2(a’) + nv(a) = 
U2(z — y) = 1, so v2(a’) = 1. There remains the possibility that 2 
does not divide z — y. 

Putting these facts together, we may write z — y = 2"d"-!q", 
where u = 0 or 1, 2" divides n, and d”~' divides n. Now let n be 
odd and z^ — y" = z^. Then x” + (—y)" = z” and by the first part 
of the proof z + y has the form indicated. 


In particular, if n is a square-free integer, n > 2, if x,y,z are 
nonzero pairwise relatively prime integers such that xz" + y” = 2”, 
then 

gS e ar, 

z=y= Da, 
and, moreover, if n is odd then 

x+y = 2d a 


where a1, a2, ga, di, dz, dg are natural numbers, u1, U2, u3 are equal to 
0 or 1, and 2”, 2%, 28, di, d2, d4 divide n. 
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III.2. Secondary Relations for Hypothetical Solutions 


In $1 we have seen that if xz, y, z are nonzero pairwise relatively prime 
integers, p an odd prime and if x? + y? + z? = 0 then the Bar- 
low relations must be satisfied; in particular, there exist integers 
rT,S,0,71, 51,1, satisfying certain properties. In this section, we give 
further properties which must be satisfied by these integers. 

If m,n are nonzero integers, gcd(m,n) = 1 and n is odd, let (m/n) 
denote the Jacobi symbol. 

The following consequence of Barlow’s relations was first indicated 
by Pierre in 1943. 


(2A) Let x,y,z be nonzero relatively prime integers such that x? + 
y? + z? — 0. 
(1) Ifplxyz then 


t 
CE M n 
8111 try T1$1 


(2) Ifp|z then 
(=) eu. 
T1581 


ar) tee 
pst, Dir 


Pnoor. (1) To begin, we note that the polynomial pXY (X + Y) 
divides (X + Y)? — (X? + Y?) (in Z[X, Y]) (see also Section VII.2). 
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By (1A) we have (x + y)?! — t? = 0 (mod pry). Then t? = 
(x + y)?! (mod r181), hence 


GHG 

(GEM T1$51 
i-es) 
811i ryt, 


(2) Ifp |z, by (1C) we have (x + y}?! — pt? = 0 (mod pzy). 
Hence pt? = (x + y)?! (mod rus), Therefore, 


p 
(=) = (=) EN 
T181 T181 
As in the proof of (1), (y + z)" ^! — r? =0 (mod pry). Therefore 
r? z(y-cz)" (mod psiti), 


ien) ae) 
Det Det: 
(5) = +1. 
prit 
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IV 


Germain’s Theorem 


In this chapter we give the beautiful theorem of Sophie Germain for 
the first case of Fermat’s theorem. 


IV.1. Sophie Germain’s Theorem 


Sophie Germain, a French mathematician, contemporary of Cau- 
chy and Legendre, proved a very beautiful theorem of an entirely 
new kind which established “d’un trait de plume” (in Legendre’s 
expression) the first case of Fermat’s theorem for every prime p < 
100. Her method is still being explored by other mathematicians. 
Legendre developed S. Germain’s ideas in his paper of 1823. 

We begin with an easy observation; (3) was also given by Bang 
(1935). 


(LA) Let q be a prime and n > 3 be an odd integer. The following 
statements are equivalent: 


(1) There exist integers a,b,c, not multiples of q, such that a" + 
b” Lon =0 (mod q). 


110 IV. Germain’s Theorem 


(2) There exist integers d, e, not multiples of q, such that d" = 
e" +1 (mod q). 
Moreover, if q — 1 = 2kn, the above statements are equivalent to: 


(3) There exist roots u, u' of the congruence X?* —1 = 0 (mod q) 
such that u Z u-- 1 (mod q). 


PROOF. (1) — (2) Since q Ke there exist integers d, e € Z such that 


dc = —a (mod q), 
ec = b (mod q). 


Then q fde, (dc)" = (ec)" +c” (mod q), so d" = e" +1 (mod q). 

(2) (1) This is trivial. 

Now we assume that q — 1 = 2kn. 

(2) — (3) Let u = e", u’ = d", then u* = et^! = 1 (mod q) and 
similarly (u’)?* = dt = 1 (mod q), with w = u + 1 (mod q). 

(3) — (2) Let h be a primitive root modulo q. Let u = hk”, 
so bim = 4?* = 1 (mod q), hence q — 1 = 2kn divides 2km, so n 
divides m. Thus u = e" (mod q). Similarly u’ = d" (mod q) and 
d" = e^ +1 (mod q). 


Now we give Legendre's version of Sophie Germain's theorem:! 


(1B) Let p,q be distinct odd primes and assume that the following 
conditions are satisfied: 


(1) If a,b, c are integers such that a? + b? + œ = 0 (mod q) then 
q | abc. 
(2) p is not congruent modulo q to the pth power of an integer. 


Then the first case of Fermat’s theorem is true for the exponent p. 


PROOF. Let x,y,z be pairwise relatively prime integers, not multi- 
ples of p, such that x? + yP + z? = 0. Then x? + y? + z? =0 (mod q) 


1See footnote, p. 13 of Legendre's paper of 1823, where he wrote: “This 
proof which, one has to agree, is very ingenious, is due to Mlle. Sophie Ger- 
main, who cultivates with success the physical and mathematical sciences, 
as witnesses the prize she has been awarded by the Academy for her paper 
on vibrations of elastic blades. She is also the author of the proposition 
in art. 13 as well the one which concerns the particular form of the prime 
divisors of o, given in art. 11." [Here, these correspond to propositions 


((2B), (2C)).] 
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and by hypothesis (1), q | xyz. We may assume, for example, that 
q | z, hence q / yz. 

Since p Y xyz there exist integers r, s, t, r1, 81, tı satisfying the rela- 
tions (1.2) and (1.3) of Chapter III. Since q | x then =r? +s” -t? = 0 
(mod q). By hypothesis (1), q divides one of the integers r, s, t. Since 
s divides y, t divides z, and q does not divide yz and q does not di- 
vide st, so q divides r. But t? = (x? + y?)/(a + y) = y"! (mod q), 
because q | x. 


Since q divides r, so y = —z (mod q). Hence 
p + EN 
EE ey (2) pt Ee 
y +2 


= py" = pt? (mod q). 


Since tı Z 0 (mod q) there exists an integer t’ such that t/t; = 1 
(mod q), hence p = (Ur IP (mod q), which contradicts the second 
assumption. 


Before proceeding, we comment on the above conditions. 

In the next section, we shall introduce the Wendt determinant, 
which will serve to test the existence of integers x, y, z, not multiples 
of q, such that x? + y? + z? =0 (mod q). 


(1C) Ifpandq are odd primes and q—1 = 2pk, k a natural number, 
then condition (2) of (1B) is equivalent to each of the following: 


(2^) (2k)?* Z 1 (mod q); and 
(2") p?* #1 (mod q). 


PROOF. We show first that (2) implies (2’). Let h be a primitive 
root modulo q and let p = h° (mod q). If (2x)? = 1 (mod q) then 
h?*s = p™ = (2k)™"p™ = (2kp) = (q — 1)?* = 1 (mod q); hence 
q — 1 = 2kp divides 2ks so p | s and p = a? (mod q) with a = bin 
(mod q). 

Now we show that (2^ implies (2”). If p?* = 1 (mod q) then 
(2k)?* = (2k)?*p?* = (q — 1) À =1 (mod p). 

Finally, we prove that (2") implies (2). If there exists an a such 
that p =a? (mod q) then p?* = a? = at! =1 (mod q). 


In Table 6 (see Legendre, 1823), we indicate, for each p « 100, 
the choice of q, of a primitive root h modulo q and the set R of 
residues of pth powers, modulo q. The computations, which are 
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quite lengthy, are done using the primitive root modulo q. They 
establish condition (2). 

For larger values of p the computations become forbidding. How- 
ever, a glance at the table reveals that in each case, q has been chosen 
to equal 2p 4- 1, or 4p - 1, or 8p+ 1, or 10p 4- 1, or 14p+ 1, or 16p+ 1. 
Indeed, the following corollaries of Sophie Germain's theorem hold 
(Legendre, 1823): 


(1D) [fp is an odd prime and q = 2p + 1 is also a prime, then 
the first case of Fermat’s conjecture is true for p. 


PROOF. We show that q satisfies the assumptions of (1B). 

If x,y,z are integers not multiples of q and x? + yP + z? = 0 
(mod q), then from p = (q — 1)/2 we have x? = +1 (mod q), y? = 
+1 (mod gq), z? = +1 (mod q). Hence 0 = z?--y?--z? =+1+141F 
0 (mod q), a contradiction. 

Similarly, if p = a? (mod q) then condition (2') is not satisfied, so 
2p 4- 1 — q divides 2? — 1 — 3, which is absurd. 


Legendre extended this criterion (1823): 


(1E) [fp isa prime, p > 3 andq=4p+1, org 8p+1, or q = 
10p +1, org = 14p +1, orq = 16p + 1 is also a prime, then the 
first case of Fermat’s theorem is true for the exponent p. 


PROOF. We show that in each case q satisfies the assumptions of 
(1B). 


Case 1: Let q = 4p +1. 

If p = a? (mod q) then by condition (2’) above, 44 = 1 (mod q) 
so 4p + 1 = q divides 255 = 3 x 5 x 17, which is absurd. 

For the first condition, let w be a primitive fourth root of 1, mod- 
ulo q. So (1, w, u?,u?) are the roots of X^ — 1 = 0 (mod q), and 
w? = —1 (mod q), w? = —w (mod q). If the first condition is not 
verified, then by (1A) there exist i Z j,0 < à, j < 3, such that 
wu) = w'+1 (mod q). Apart from trivial cases, this leads to one of 
the following possibilities: w = +2 (mod q) or 2w = +1 (mod q). 
Raising to the square it follows that q — 5, which is absurd. 


Case 2: Let q = 8p +1. 
If p = a? (mod q), then proceeding as before, 8p + 1 = q divides 
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TABLE 6 

p q h R 

3 7=2x34+1 3 +1 

9 1122x521 2 rz 

7 | 29=4x7+1 |2 |l, +12 

11| 23=2X11+1 15 | + 

13} 53=4x 1341 |2 1, +23 

17|]13728 x 17 4 1 |3 |+1, £10, +37, £41 

19 | 191 = 10 x 19 1| 19 | E1, £7, £39, £49, £82 

23| 47=2x 2341/5 |+l 

29| 59=2x 294+ 1/2 | + 

31 | 311 = 10 x 31 +1] 17 | £1, £6, £36, £52, £95 

37/149=4 x 37 +1 |2 1, +44 

41) 88=2x414+1/2 | + 

43 | 173 = 4 x 484+ 1 | 2 1, +80 

47|659 = 14 x 47 -1|2 | xl, £12, £55, +144, +249 
270, +307 

53 | 107 =2 x 53 +1 2 Sie 

590|827— 14 x 59-1|2 |+1, +20, +124, +270, +337 
389, +400 

61[977 216 x 61 -1|3 |+l, +52, +80, +227, +252 
357, +403, +439 

67 /269=4x67+1 |2 | +1, +82 

71/569=8x71+1 |3 Æl, £76, £86, +277 

73|203— 4 x 78-1 2 |+1, +138 

79|317—4x 19-1 |2 | £1, +114 

83|167—2x83-1 5 +1 

89 |} 179 =2 x 89+ 1 2 T 

97 389=4x97+1 |2 1, +115 
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88 — 1, so it divides Si — 1 = 4095 = 3? x 5 x 7 x 13 or 84 + 1 = 
4097 = 17 x 241, which is an absurdity. 

If the first condition is not verified, if w is a primitive eighth root 
of 1 modulo q, there exist i Æ 7,0 < i, j € 7, such that u? = wê +1 
(mod q). SC wt = —1 (mod q) we may consider the congruences 
wi = +w’ +1 (mod q), with 0 € à € 3. The primitive eighth roots 
of 1 modulo q being +w, +w?, and w? = —w ! (mod q), we are 
reduced to study the following possibilities: 
+2 (mod q) (j = 1,2); 
w+1 (mod q); and 

(iii) 2w = +1 (mod q). 

We discuss the various cases. 

(i) Raising to the fourth power leads to q | 15 or q = 17, an 
absurdity. 

(ii) Raising to the square, 


w+1+2w+1=3w+2 (mod q), 
w — 1 — 2w + 1 = -—w (mod q). 


-lsutt2w+1={ 


Hence w = +1 (mod q), an absurdity. 
(iii) Raising to the fourth power, q = 17, absurd. 


Case 3: Let q = 16p + 1. 

If p = a? (mod q), then with the same method, 16p+1 = q divides 
1616 — 1 = (165 + 1)(16* + 1) (16? +1) x 17 x 15. Clearly 16p+ 1 does 
not divide 15, 17, 257. If 16p + 1 divides 16* + 1 = 65537, which 
is a prime (in fact a Fermat prime, 2! + 1), then p = 16?, which 
is absurd. If 16p + 1 divides 165 + 1 = 2% + 1 = 641 x 6700417 
(decomposition into primes given by Euler), then p — 40 or 418776, 
again absurd. 

If the first condition is not verified, if w is a primitive sixteenth 
root of 1 modulo q, since tw, tw?, u^ tw are all the primitive 
sixteenth roots of 1 modulo q, and since w8 = —1 (mod q) there exist 
i,j,0<i<j <7, such that wi = Lat 36d q). This leads to 
one of the following congruences, with all possible sign combinations: 


(i) w = +2 (mod q) 491,25. 
(ii) w = w +1 (mod q); 
(iii) w? = +w à 1 (mod q); 
(iv) mi = w +1 e q); and 
(v) wt = xu? +1 (mod q). 
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We discuss the various possibilities. 

(i) Raising to the eighth power leads to q | 257 or q | 255 = 
3 x 5 x 17, an absurdity. 

(ii) Raising to the square leads to wt = —w (mod q), which is 
impossible, or wł = 3w +2 (mod q). Squaring again, this leads to 
—] = 9u? 4- 12w +4 (mod q), and substituting w? = w +1 (mod q) 
we obtain 3w = —2 (mod q). Hence wt = 3w +2 = 0 (mod q), 
which is impossible. 

(iii) Raising to the cube: 


—w = xu? + 3w° +3w+1 (mod q) 


and substituting, 


—w=wtl4+3w?+3w+1 (mod q). 


According to the choice of the sign, we have 3w? + 5w +2 = 0 
(mod q) or 3w? — w = 0 (mod q). 

In the first case, multiplying with w and substituting, we obtain 
5w? + 5w +3 = 0 (mod q); subtracting, 2u? = —1 (mod q) and 
therefore —2* = 1 (mod q), so q | 17, impossible. If 3w = 1 (mod q), 
then 3w? = —3w + 3 = 2 (mod q) so raising to the cube, —3*w = 8 
(mod q), hence q divides 17, an absurdity. 

(iv) Squaring: —1 = w? + 2w + 1 (mod q), so w° = F2w — 2 
(mod q), hence w +1 = wt = 4w? + 8w +4 (mod q). This gives, 
according to the choice of sign, 


and subtracting, 
= { ter (mod q). 


So w^ = w + 1 = —4 (mod q) and raising to the square, —1 = 16 
(mod q) so q | 17, impossible. 

(v) Raising to the square, —1 = w* + 2u? + 1 (mod q) and 
substituting wt = +w? +1 (mod q), in all cases we obtain w? = +1 


(mod q), which is impossible. 


Case 4: Let q = 10p +1. 
If p = a? (mod q), by the above method 10p + 1 = q divides 
(10° + 1)(10° — 1). If q divides 10? + 1 = 100001 = 11 x 9091 (this 
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last number is a prime) then p = 909, an absurdity. If q divides 
10° — 1 = 99999 = 3? x 41 x 271 then p = 4 or 27, an absurdity. 

If the first condition is not verified, if w is a primitive tenth root 
of 1 modulo q, then there exist i,j, 0 < i € j < 4, such that wi = 
w +1 (mod q). The above conditions lead to either one of the 
following congruences: 

(i) w? = +2 (mod q) (j = 1,2,3,4); 
(ii) w = w +1 (mod q); and 
(iii) wt = w +1 (mod q). 


We discuss the various possibilities. 

(i) Raising to the fifth power leads to q | 31 or q | 33 = 3 x 11, 
impossible. 

(ii) We have 


3w +2 
—4 


w =w? +2w+1=w+1 2w+1={ (mod q). 
The second case is not possible. In the first case multiplying with 
w and substituting —1 = 3w? + 2w = 3w + 3 + 2w (mod q) so 
5w = —4 (mod q). Multiplying with the previous congruence, —5 = 
—12w — 8 (mod q) so 12w = —3 (mod q) and 4w = —1 (mod q) 
hence subtracting, w = —3 (mod q) and therefore q | 11, absurd. 

(iii) Multiplying with w: —1 = w? +w (mod q), so w? = Fw—1 
(mod q) and this was considered in case (ii). 


Case 5: Let q = 14p + 1. 

If p = a? (mod q), we see in the same way that q divides 147 + 
1 or 147 — 1. But 147 +1 = 105413505 = 3 x 5 x 7027567 (this 
last number is a prime). Then p = 501969, an absurdity since this 
number is a multiple of 3. Also, 147—1 = 105413 503 = 13x8 108731 
(this last number is a prime). Then p = 579195, which is absurd. 

If the first condition is not verified, if w is a primitive fourteenth 
root of 1 modulo q, then there exist à Æ j, 0 < i, j € 13, such that 
wi =w'+1 (mod q). Since w” = —1 (mod q), the above conditions 
lead to either one of the following congruences: 


(i) u? = +2 (mod q) (j = 1,2,... ,6); 
(ii) w? = w +1 (mod q); and 
(iii) w? = +w +1 (mod q). 


We discuss the various cases. 
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+27 (mod q), so q | 127 or 129 = 3 x 43, which gives 
is was excluded. 
ii) w?’=w+1 (mod q) gives 


wi=w?+2wt+l=wtl ESTE et? (mod q), 


the second case being impossible. Squaring, 
—w=9w?+12w+4=9w+9+12w+4 (mod q), 


so 22w=—13 (mod q). Then 22u? — 22 = —13 (mod q) so 22w?=9 
(mod q). Then —13w=9 (mod q) and from this we obtain 9wz —4 


(mod q), —4w = 5 (mod q), 5w = 1 (mod q); so 25 = —20w = —4 
(mod q), hence q | 29, which is impossible. 
(ii) w? = +w +1 (mod q) gives, to the cube: 
—w? = +w? + 3w? + 3w +1 


= +w + 1+3w°+3w +1 = 3w? + dw +2 (mod q), 


hence 2w?+2w+1= 0 (mod q) so 2(4w+1)+2w?+w =0 (mod q) 
and 

0 (mod q), 

0 (mod q), 


N N 


2w? — w 


{ 2w? + 3w + 


hence 


—3w + 0 (mod q), 


which is impossible. 


{ cm (mod 4), 
) 


and this gives w = +1 (mod q 


With this criterion, Legendre had actually shown that the first 
case of Fermat’s theorem holds for every prime exponent p < 197. 
Indeed, for each such prime p, there exists a prime q = 2kp+1, with 
2k € (2,4,8,10,14, 16). On the other hand, 38 x 197 + 1 = 7487 is 
a prime, but 2k x 197 + 1 is not a prime if 2k < 38, 6 J//2k. 

The limitation in Legendre's results was due to the size of the 
numbers involved. For example, to test whether p — 197 is not a pth 
power modulo q — 7487, would lead to find whether 7487 divides 
387? + 1. Maillet extended Legendre's result in 1897, pushing the 
limit up to p — 211. 

Mirimanoff used a method involving Bernoulli numbers, in 1905, 
to extend the results to 257. 
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In 1908, Dickson published two papers in which he explored Legen- 
dre’s ideas and with a more careful analysis involving congruences, 
he showed that the first case of Fermat’s theorem holds for every 
prime exponent p < 7000 (with the exception of p = 6857, which he 
did not take the trouble to examine). See also Maillet’s comments 
(1908). 

More progress along this line was made by Krasner (1940), Dénes 
(1951), and Rivoire (1968). 

For the primes p such that 6p + 1 or 12p + 1 is also a prime, the 
method of proof breaks down and does not lead to any conclusion. It 
should be noted that in 1974 Gandhi announced without proof that 
if p and 6p+1 are primes then the first case holds for p; since no proof 
has ever been published, there is reason to doubt of the justification 
of the statement. In this connection, we quote the paper by Granville 
and Powell (1988). 

An interesting, but very difficult question is whether there exist 
infinitely many primes p such that 2p + 1 (or 4p + 1, or 8p + 1, 
etc...) is also a prime. We discuss this problem in the Appendix to 
this chapter. 

We still note here the following result of Vandiver (1926) : 


(1F) Let p and q = 2kp +1 be odd primes (with k > 1). If 
2k = 2"p^, where h > 0 and p does not divide v, and if 2 is not a 
pth power modulo q, then condition (2) above is satisfied. 


PROOF. We show (2^). If (2k)?* = 1 (mod q) or equivalently if = 1 
(mod p), then 2?^" = 22*eg2kh — (Wp)? = (2k)?* = 1 (mod q). 
Since p does not divide v, there exist integers a,b such that av — 
1+ bp. Then 


1 = 92kva = 9 (1+bp) 2k = EE = Ech = 92k (mod q). 
If g is a primitive root modulo q and 2 = g? (mod q), then 1 = 


2?* = g?** (mod q). So q — 1 = 2kp divides 2ks, hence s = ps’ and 
2 = (g*)? (mod q), which is a contradiction. 


Using (1B), Vandiver deduced in 1926 the following result which 
however had been proved by Wendt in 1894, using his form of Sophie 
Germain's theorem: 
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(1G)  Ifp and q = 2kp +1 are odd primes, with 2k = 2"p^, h > 0 
and v not divisible by p, if the congruence X? +Y?” + Z" = 0 (mod q) 
has only the trivial solution, then the first case of Fermat's theorem 
holds for the exponent p. 


PROOF. By (1B) and (1F) it suffices to show that 2 is not a pth 
power modulo q. If 2 = a? (mod q) then a? + (C1)? + (-1)? = 
2 4- (—1) + (71) = 0 (mod q) contrary to the hypothesis. 


Sophie Germain's theorem, corollaries and variations were redis- 
covered by several authors. In 1953, Thébault proved: 


(1H) If m > 2 is an integer such that 2m + 1 is a prime, if 
there exist pairwise relatively prime nonzero integers x,y,z such that 
x + y" = z" then 2m + 1 divides xyz. 


PROOF. If 2m + 1 does not divide x then by Fermat’s little theorem 
z?" = 1 (mod 2m + 1) hence z^ = +1 (mod 2m + 1). 
Similarly 


y" = +1 (mod 2m + 1), 
z" = +1 (mod 2m +1), 


hence 0 = z" 4- y™ — 27^ = (-E1) - (+1) — (41) (mod 2m 4 1), which 
is impossible. 


This same result (even with the further hypothesis that m be 
prime) is proved again by Stone in 1963 and Gandhi in 1966, in the 
same journal! 

Gandhi showed, also in 1965, a result similar to Thébault's: 


(11) Ifm> 2 is an integer such that Am--1 is a prime, if x,y, z are 
nonzero pairwise relatively prime integers such that zim + y" = z" 
then 4m + 1 divides xyz. 


PROOF. If m = 3, the statement is trivially true, by Chapter I, $4. 

Let m > 3 and assume that 4m + 1 does not divide xyz. From 
x" +y” = z" it follows that z?"' 4- y?" 4-254" = z?". Since 4m 4-1 
is a prime not dividing x then by Fermat's little theorem zim = 1 
(mod 4m + 1), so z?" = +1 (mod 4m +1). Similarly 


y^" =+1 (mod 4m +1) 
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and 


2? = +1 (mod 4m +1). 


Hence +1 +1 + 2z"y" = +1 (mod 4m + 1) so 2a™y™ = +1 or +3 
(mod 4m + 1) and +4 = Jumm = | or 9 (mod 4m + 1). This 
implies that 4m + 1 = 3,5, or 13, hence m = 3, which is à contra- 
diction. 


This same result (with the further hypothesis that m is prime) is 
proved again by Gandhi in 1966 and 1970 and by Christilles in 1967. 
Perisastri (1969) proved: 


(1J) fp > 51 is a prime such that 8p+1 is also a prime, if x,y, z 
are nonzero pairwise relatively prime integers such that x? + y? = z? 
then 8p + 1 divides xyz. 


(1K) Ifm>3 is an integer such that 3m+1 is a prime, if x,y,z 


are pairwise nonzero relatively prime integers such that x" + y” = 
z™, then 3m+1 divides xyz. 


Krishnasastri and Perisastri proved in 1965: 


(1L) ` nie an odd prime, if x,y,z are integers such that x? +y? = 
z?” and p does not divide xz, then there exists an integer k > 1 such 
that 1+ kp divides z. 


Combining (1C) with Sophie Germain’s theorem, we have (see 
Stone (1963), Perisastri (1968)): 


(1M) Let p and 2p + 1 be odd primes. If x,y,z are nonzero, 
pairwise relatively prime integers such that x? + y? + z? = 0, then p? 
divides one (and only one) of the integers x, y, z. 


PROOF. By Sophie Germain’s theorem we may assume, for example, 
that p divides z. By (1C), p? divides z. 


Pomey obtained in 1923 and 1925, with similar methods, several 
sufficient conditions for the first case of Fermat's theorem for the 
prime exponent p: 
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(1N) Let p be an odd prime and assume that either one of the 
following conditions is satisfied: 
(a) p=1 (mod 4) and 2p + 1 divides 2? +1. 
(b) p=3 (mod 4) and 2p + 1 divides 2? — 1. 
(c) 4p +1 divides 27? +1. 
(d) 4p -- 1 2 5 (mod 12) and 4p + 1 divides 3°? +1. 
) 
) 


8p + 1 divides 24? — 1. 
10p +1 divides 5°? — 1. 


Then the first case of Fermat’s theorem is true for the exponent p. 


All the above results do not suffice to conclude that there exist 
infinitely many prime exponents p for which the first case of Fermat's 
theorem is true. T'his was first proved, with analytical methods, in 
1985 by Adleman and Heath-Brown and and by Fouvry. 

Earlier, in 1897, studying the class group of the cyclotomic field, 
Maillet showed that for every odd prime p there exists an exponent 
e (depending on p) such that the first case of Fermat's theorem is 
true for the exponent p*. In particular, this implied the existence of 
an infinite set of pairwise relatively prime exponents for which the 
first case is true. This last statement was proved again by Kapferer 
in 1964. His proof was not elementary, since it used the theorems of 
Furtwängler, as generalized by Moriya (requiring class field theory). 
In 1978, Powell discovered independently the following very simple 
proof: 


(10) 
(1) If p is any odd prime, n = p(p—1)/2 = 2"m where u > 0, m 
is odd, if x,y,z are nonzero integers such that x" --y"^--z^ = 0 
then gcd(m, xyz) £ 1. 
(2) There exists an infinite set of pairwise relatively prime expo- 
nents for which the first case of Fermat’s theorem is true. 


PROOF. (1) If p = 3 then n = 3 and the hypothesis is not satis- 
fied. Let p > 3. Suppose that gcd(m, xyz) = 1. Then p J xyz so 
z(-)/? = +1 (mod p) and z^ = +1 (mod p). Similarly y^ = +1 
(mod p), z^ = +1 (mod p), hence x” + y" + z" #0 (mod p) and a 
fortiori, x" + y" + 2” z 0. 

(2) Assume that n;,... , n; are pairwise relatively prime expo- 
nents for which the first case of Fermat's theorem is true. Consider 
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the arithmetic progression (—1 + 4nin2---ngt | t = 0,1,2,...). By 
Dirichlet’s theorem on primes in arithmetic progressions there exists 
an odd prime p such that p = —1 (mod Ann ---nx). Let nun = 
p(p — 1)/2, so ny41 is odd. Since ged (p(p — 1)/2, (p + 1)/2) = 1 
then gcd(ng41, Ni tenk) = 1. By (1), the first case is true for the 


exponent Hr, and this suffices to complete the proof. 


1823 


1879 


1894 


1897 


1905 


1908 


1908 


1908 


1910 


1923 


1925 
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IV.2. Wendt's Theorem 


Wendt indicated in 1894 a determinantal criterion for the existence 
of a nontrivial solution of Fermat's congruence 


(2.1) X? -- Y? + ER =0 (mod q), 


where p,q are distinct odd primes. 

To begin, we wish to exclude from our discussion the following 
trivial case; it also holds without assuming the exponent in (2.1) to 
be a prime: 


(2A) Ifq is an odd prime, if n > 1 is such that gcd(n, q —1) = 1 
then there exist integers x,y,z, not multiples of q, such that z" + 
y" +z” =0 (mod q). 


PROOF. By hypothesis, gcd(n, q — 1) = 1, so there exist integers a, b 
such that an + b(q — 1) = 1. Let zo, yo, zo be integers, not multiples 
of q, such that £o + yo + zo = 0 (mod q). Then 


xa” = £o (mod q), 
= yo (mod q), 
= zu (mod q), 


so (x8)" + (y@)” + (z2)" =0 (mod q). 


In particular, if n = p is a prime not dividing q — 1 then (2.1) has 
a nontrivial solution. 
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Wendt’s criterion is expressed in terms of the circulant of binomial 
coefficients. More generally, let n > 1 and let £; = cos2mi/n + 
J/—Isin2ri/n (for i = 0,1,...,n — 1) be the n nth roots of 1; so 
£o = 1. The circulant of the n-tuple (ao, a1,... , a, 1) of complex 
numbers a; is, by definition, the determinant of the matrix 


gu Gy ... Qni 
(2.2) ae An-1 Qo ... Qn-2 
ou Q3 ... ` On 
We denote it by Circ(ao, &1,... ,, gu 1). The circulant is expressed 


in terms of nth roots of 1 and equally as the resultant of two poly- 
nomials (see Chapter II, $4). Spottiswoode (1853) and also Stern 
(1871) and Muir (1920) proved: 


LEMMA 2.1. Let 0,01,... , An-1 € K, let G(X) = Qo + a, X ++ 
an-ı X”! and let & = 1,£1,...,€:-1 be the nth roots of 1. The 
circulant of ao,d1,... ,@n—1 is equal to 


n—1 


Circ(ao, à4,... , 04.1) SE (€;) = Res(G( X), X” — 1) 


(where Res denotes the resultant). 


PROOF. Let 
010 - 0 
0 0 1 : 0 
A= 
100 -œ 0 


(n x n matrix), so I, A, A?,... , A”! are distinct and A” = J. As is 
easily seen, 


C = aol + aA + a24? +- H an AE. 


The characteristic polynomial of A is det(XA — I) = X” — 1. Since 
it has distinct roots £o = 1,£1,... ,£»_1, then À is diagonalizable, 
that is, there exists an invertible matrix U (with complex entries) 
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such that 
£o 0 0 
0 
UAU- = & 
0 0 oo 
Hence 
G(f) 0 0 
0 G m 0 
LUCE = : z 2 : D 
0 0 Ges G(En—1) 


where G(X) = agta,X+-+-+an_-1X"~'. So Circ(ao, ou... ,@n—1) = 
det(C) = det(UCU~) = TU" G(&). By Chapter II, (4B), we also 
have Circ(ao, gu... , a4) = Res(G(X), X” — 1). 


The following result of Wendt is about the circulant of binomial 
coefficients. Accordingly, for every n > 1 we define the Wendt de- 
terminant to be 


(2.3) weise). (3) LGS 


IEG(X) = 1 ()X- X84 (DANS = (LEX) - X" then by 
the lemma, W, = Res(G(X), X^ — 1) = ID [(1 + £)^ — 1]. Now 


we give Wendt's criterion (1894); see also Matthews (1895), Bang 
(1935), and an expository presentation by Rivoire (1968): 


(2B) Let p be an odd prime and assume that q = 2kp + 1 (with 
k > 1) is also a prime. Then there exist integers x,y,z, not multiples 
of q, such that x? + y? 4- z? = 0 (mod q) if and only if q divides Wap. 


PROOF. By (1A), Fermat's congruence X? + Y? + Z’ = 0 (mod q) 
has a nontrivial solution if and only if the system of congruences 


X?* =1 (mod q), 
{ (X +1)* =1 (mod o), 


has a nontrivial solution, or equivalently, the system of congruences 


X?* —1 =0 (mod q), 
(X + 1)?* — X?* =0 (mod q), 
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has a nontrivial solution. This holds exactly when the resultant of 
the polynomials X?* — 1 and G(X) = (X +1)* —- X* = 1+ DIR + 
C7) X? + +--+ (25) X?*-! is congruent to 0 modulo q; by Lemma 
2.1 this means that W2, = 0 (mod q). 


Now we turn our attention to the computation of Wendt's deter- 
minant 


(2.4) = TI I4 ex. 


The following result was stated without proof by Wendt (1894), and 
proved thereafter also by Matthews (1895), E. Lehmer (1935), Bang 
(1935), and Frame (1980): 


(2C)  W, — 0 if and only if 6 divides n. 


PROOF. Assume that 6 divides n, and let E = £i = cos2r/n + 
V=Tsin2r/n. Let | = n/3, so w = EI is a primitive cubic root of 1. 
Hence 1 +w +w? = 0, therefore 1 + £' = —£2! and (1-- £)” = 1. We 
conclude that W,, = 0. 

Conversely, if W, = 0 there exists j such that (1 + £j)" = 1, 
so j, 1 +6; are nth roots of 1 and since the triangle with vertices 
0, 1, 1 E £5 


£j 1+ & 


0 1 


is equilateral, then 6 = 27/6 (or 0 = —27/6) and 1+; is a primitive 
sixth root of 1. But (1 + £;)" = 1 hence 6 divides n. 


As a corollary: 


(2D) If p and 6mp +1 = q are primes then the congruence 
X? + Y? + Z? =0 (mod q) has a nontrivial solution. 


PROOF. This follows at once from (2B) and (2C). 
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In view of (2C), it is customary to modify the definition of the 
Wendt determinant when 6 divides n, by putting 


w, = Toi 
i=0 


where 

(LK +1)" — xX” 
X?4 X41 ` 

In 1935, E. Lehmer indicated, without proof: 


(2.5) G(X) = 


(2E) Ifd divides n then W, divides Wn. 


PROOF. We may assume that W4 Z 0, i.e., 6 Yd. We note that since 
d | n then each dth root of 1 is also an nth root of 1. From W, 4 0, 
then 


Wn (1+6) -1 


— = [=x I] ([G+6)"-1] 
Wa ei (1 T 6) -1 Elia 

If n = de then 
WEE 


UL E pec e Da 


Therefore W, /W/ is an algebraic integer, but also a rational number, 
hence a rational integer. 


The next property was also indicated by E. Lehmer, without proof. 
A proof (not the one below) appears in Frame's paper (1980); the 
weaker statement 2" — 1 divides W,, was proved by Bang (1935): 


(2F) Ifn>1 then W, = (—1)"-! (2^ — 1)u?, where u is an integer. 


PROOF. 


n—1 


W, = Deere 


j=0 


where € = cos 2r /n + v—1sin2r/n. Thus W, has the factor 2” — 1 
(when j = 0) and if n is even, also the factor —1 (when j = n/2). 
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So 
LE e EEN HIE E 


j#0,n/2 

Let u = Tocjensal(1 + £)” — 1]. We note that for every j 4 0, n/2 
we have (1+ €-4)"—1 = [€-4(1+ £/)]" —1 = (14+ &)"—1 so wis real 
and W,, = (—1)""'(2” — 1)u?, and it remains to show that u € Z. 
Let o be any automorphism of the field Q(£), so o(€) = EI where 
1 € «€ n and gcd(l,n) = 1. Hence c [(1 + £)” — 1] = (1 + £^)" 

where 0 < k < n/2 and jl = +k (mod n) because (1 + £^ ^)^ — 1 = 
(12-£*)^ —1. If0 < j, j' < n/2, let k, k' be such that 0 < k, k' < n/2 
and jl = +k (mod n), j'l = +k’ (mod n). We note that if j Z 7’ 
then k Z k’, because if jl = +j'l (mod n) then j = +)’ (mod n) 
and this would imply j — j'. Therefore, different factors of u have 
distinct images by c, so c(u) = u hence u € Q, being invariant by 
the automorphism of Q(£). But u is also an algebraic integer, hence 
u EZ. 


For n even, Frame proved (1980): 


(2G) Ifn=2m and 3 |n then W, = —3((2" — 1)/3)?u8 where 
u is an integer. In particular, if p is a prime, p = 5 (mod 6), then 
W,-1 = —3((2?-1 — 1)/3)?u8, where u is an integer. 


PROOF. Let n = 2m. Since 3 Yn, p = &? is also a primitive nth root 
of 1. From p™ = —1 by (2.4) we have 


2m-—1 2m—1 : 
= II II (1 + om" + pr), 
j=0 k=0 


If j = m we have Te) (2 ent = 2?" — 1, similarly if k = 
m we have [kč (2 + p™t/) = 2?" — 1, while if j = k we have 
LÉ !(1— 297) = 1-22", Discounting the repetition of factors with 
equal summands, and noting that for j = k = m we have the factor 
3, then W, = az — 1)/3)?v where 


H ; 
v= II (1 + pti + p™t*) 


(IT indicates the product for all (j,k) such that 0 < j, k < 2m — 
1, j z m, k £ m, and j 7 k). 
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The geometric mean of 1, p", p™** is £J** (because p = £?); 
dividing each factor in the above product by the geometric mean of 
the summands, we get 


is De“ Ee E TUR + EKARI), 
The first product is equal to 


II II ich = II £)@m—2) II ch 


Zeien km,kzj jzm kzm,kzj 
2m—1 
= [nE JI ch 
jzm j=0 
= Mene detre Dam 
jem 
jem 
2m—1 
= (Sa) I] pom 
j=0 


cap Ure pore dme 


== EE CI —1. 


The second product is equal to [| (£* + €/ + £?) (where JJ” is the 
product for all (e, f, g) such that 0 € e, f, g < 2m — 1, e, f, g are 
distinct and e + f + g = 0 (mod 2m)). Indeed, since j Z k, j # 
m, k z m, and 3 Ym, letting e, f, g be such that 0 € e, f, g € 2m— 1, 
and 


f 2S m 2j — k (mod 2m), 


| e = —j — k (mod 2m), 
g 2 m- 2k — j (mod 2m), 


then e, f, g are distinct and e+ f + g = 0 (mod 2m). 
Conversely, for every triple (e, f, g) as indicated, let j,k be such 
that 0 < j,k € 2m — 1, and 


jaf-e-m (mod 2m), 
kzg-—e- m (mod 2m), 
so j £m, k Z m, and j z k. 


Let u = TT (£* + Ef + £9) (where TTT is the product for all (e, f, g) 
such that 0 € e « f « g € 2m — 1 and e+ f +g 2 0 (mod 2m)). 
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We show that u is invariant by every automorphism oc of Q(£). 
If o(f) = EI, with gcd(l,n) = 1, 1 < l< n, then o(£* + & + £9?) = 
ET + EN + £9! which is a factor of u. If (e', f',g') Æ (e, f, g) then 
el, fl, gl cannot be congruent modulo 2m to e'l, f'l, g/l, respectively, 
or to these numbers in any order. So different factors of u have 
different images by o, hence o(u) = u, showing that the algebraic 
integer u is rational, hence u € Z. But v = u because each factor 
EC HET --£? of v is equal to a factor £^ +E% -- €? of u, where (e', f’, g') 
is obtained by permutation of (e, f, g). 

Hence W, = —3((2" — 1)/3)?u8. 


Before the next result we need to establish a lemma. 


LEMMA 2.2. Let n be such that 2n +1 = p is a prime, let € = 
cos 27 /n + /—1sin27/n, and let P be any prime ideal of the cyclo- 
tomic field Q(£) which divides p, i.e., PAZ = Zp. Then there exists 
a primitive root s modulo p such that € = s? (mod P). 


PROOF. Indeed, let g be any primitive root modulo p, i.e., g mod- 
ulo p has order p — 1 — 2n, so g? modulo p has order n, and the 
set of elements modulo p with order n is (g? (mod p) | 1 € j < 
n, gcd(j,n) = 1}. If1 < j,k < n, gcd(j,n) = gcd(k,n) = 1, and 
j z k then g” z g?* (mod p). By Chapter II, (3F), $,(g?) = 0 
(mod p) for all such exponents j. On the other hand, ®„(&) = 0 
so $,(£7) = 0 (mod P) for all j, 1 < j < n, ged(j,n) = 1. Hence 
there exists j such that € = g (mod P), and we just take s to be 
s = kg) (mod p), 1 € s < p. 


Frame also proved the next statement (while Bang had noted in 
1935 that p | W,,): 


(2H)  If2n41- p is a prime then ri HIT divides W,. 


PROOF. To begin, let € = cos2r/n + /—1sin2-/n. Given u,v, 
1<u,v<n, gcd(u,n) = gcd(v, n) = 1, for every j, 1 € j < n, such 
that gcd(j, n) = 1 let f; = 1 — E — £v. 

Let duo = I Lata) D, We show that d € Z. Indeed, let o be 
any automorphism of Q(£), so o(£) = EI with gcd(l,n) = 1. Then 
c (1— Ein — £j") = 1 — £*" — Ei where 1 € k < n and k = jl (mod n), 
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ie. c(f;) = fx. Every factor f, of d,, is so obtained, therefore c 
defines a permutation of the set of factors f; of d, Thus 


T(duv) = Il o(f;) = II Í; = oru 


ged(j,n)=1 ged(j,n)=1 


This proves that d,, € Q and since du, is an algebraic integer, then 
du» € Z. If P is any prime ideal of Q(£) such that PN Z = Zp, if s 
is a primitive root modulo p such that £ = s? (mod P) then 


duv = II (1 — sin — s) (mod P), 


gcd(jn)=1 
SO 
du» = II (1 — 8" — 57%) (mod p). 
gcd(jn)=1 
For every h, 2 < h € p — 2, and such that h? 4 —1 (mod p) in 
case p = 1 (mod 4), we define ap, bn,0 < an, b, < p — 1 by 


{ an = 2h/(h? +1) (mod p), 
bn = —(h? — 1)/(h? +1) (mod p). 


Then a; Æ 0,1 (mod p) and by, Æ 0,1 (mod p). Hence there exist 
u = Up, v = v, such that 1 < u, v € p — 1 and 


an = s" (mod p), 
b, = s" (mod p). 


We have a? +b? = 1 (mod p) so 1 — s?" — s?" =0 (mod p), therefore 
p divides duw- 

If h, h/ with 2 < h, h/ € p—2 are such that h? # —1 (mod p), h^? # 
—1 (mod p) when p = 1 (mod 4), and if h = +h or +h (mod p), 
where hh = 1 (mod p) then it is easy to check that ay = tay, 
(mod p) or ay = +b, (mod p) while by = +b, (mod p) or by = 
+a, (mod p), respectively; hence, with obvious notations, 1 — s?" — 
s? = |] — s?v — s?" Conversely, if h, / are such that this equal- 
ity holds then either a, = +a, (mod p), bj = +by (mod p) or 
an = +b, (mod p), bj = +aw (mod p). Examining all possible 
cases, this leads to h’ = +h or +h (mod p). 

If p Z 1 (mod 4), the number of possible values for h is p — 3 = 
2n — 2; this yields [(2n — 2)/4] = |(n — 1)/2] factors 1 — s?" — s?" 
which are multiples of p. 
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If p = 1 (mod 4), the number of possible values for h is p — 5 = 
2n — 4; this yields [(2n — 4)/4] = [(n — 1)/2] (because n is even) 
factors 1 — s?" — s?" which are multiples of p. 

Thus, in all cases, p^-9/2 divides TL. d, Let Wn = [lu v dut 
where t € Z (since t is an algebraic integer and t € Q); thus in" II 
divides W,. 


For example, 47!! divides W:3 and 101?* divides Ws. 

The next divisibility result concerns Lucas numbers. For the con- 
venience of the reader we shall recall some relevant facts about Fi- 
bonacci and Lucas numbers (see also Chapter V, $3). 

The Fibonacci numbers F, (n > 0) are defined as follows: 


Fo = 0, F = 1, 
and for n > 2: 
B= Frs + Fo. 


Similarly, the Lucas numbers L, (n > 0) are defined as follows: 


and for n > 2: 
Ly = Du + L3. 
Let a, 3 be the roots of the polynomial X? — X — 1, so 
1+ V5 
2 
_1- 
"7 2 


= 1.6180... , 


B 


= —0.6180..., 


and 
a 4 B — 1, a — B = V5, o = —1. 


As is known, a is called the golden number (or golden ratio). 
The following lemma is attributed to Binet (1843): 


LEMMA 2.3. For every n > 0: 
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PROOF. We have o? = a + 1, 8? = 8 +1, so a" = a"! + a"? and 

also 8" = 9"-! + 8"? (for n > 2). 
Let T, = (a" — 8")/(a — B), so To = 0, Ti = 1 and 

an berg o"? — gn? a” — gn 

The d TIS = ü 

1 2 ENT + ee = 

(for n > 2). Thus, the sequence {T, | n > 0} coincides with the 

Fibonacci sequence. Similarly, let U, = a" + 8", so Uo = 2, U, = 

1 and U, 14 -U, 3 =Q” 148” Liq 245" Se a + 8" = U (for 

n > 2). Thus, the sequence {U,, | n > 0} coincides with the Lucas 

sequence. 


For more results about Fibonacci and Lucas numbers, see, for ex- 
ample, the books by Vorob'ev (1961), Hoggatt (1969) or Ribenboim 
(1995). 

Frame proved: 


(21) Ifn is odd then L? divides Wn. 


PROOF. By (2C) we may assume that 3 /n. We have 


n—1 


Ha-g- = - Ta - eoa - ee) 


| = -(1- ein 6") 
= 14 (a +6") - (-1)^ 
mp. 


since n is odd, and using the preceding lemma. Similarly, LE (1 — 
ER EF) = La. Now we note that if 1 < j, k < n—1, then the pairs (j 
mod n, 2j mod n) and (2k mod n,k mod n) are distinct. Indeed, 
otherwise j = 2k (mod n) and 2j = k (mod n), hence 3j = 3k 
(mod n) and j = —k (mod n). Therefore j = k and j = n — k, so 
n — 2k is even, contrary to the hypothesis. 

This shows that W, = Liv where v € Q, and v is an algebraic 
integer, so v € Z. 


For example, the squares of the Lucas numbers 


L4; = 6643838 879 
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and 
Ls, = 119218 851 371 


(which are known to be primes), divide W47, W53, respectively. 

Improving on a result of Lubelski (1935) and independently of 
Bang (1935), E. Lehmer indicated in 1935 the following divisibility 
property of the Wendt determinant: 


(2J)  Ifp is an odd prime then p?~?(2?-1 — 1)/p divides W, 4. 


PROOF. Consider the matrix C whose determinant is W,_1: 
meee ney eats) 
1 2 p—2 
KS 
1 p—3 
Ec Goa) E 
p-3 p-2 pcd 


e) (2) 6)» 


Adding every column of C to its last column, we obtain a matrix C" 
whose last column has all its elements equal to 


px pe pe E 
1 ged Eh = 1, 


Adding to each column of C" (up to the column p — 3) the next 
column, we obtain a matrix C" such that the elements of the first 
p — 3 columns are of the form 


ree 


for k —0,1,...,p —2. These elements are all multiples of p. Thus 
W,-1 = det C" is a multiple of p?73(2?71 — 1) = pr-2(2P-121)/p. 


For a recent related result, see Helou (1997). 
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Frame observed experimentally that if n < 50 and 6 Vn then 


log; [Wal — n? logio c| < 0.33, 


where 


2 n/3 
logc — =] log(2 cos 0)d0, 
T JO 


so log, c = 0.140305. Thus W, has about 0.1403 n? digits. The 
knowledge of the size of W,, and of some prime factors of W,, may 
guarantee that the factorization of W,, is already complete. 

In 1982, Boyd showed that the sequence {A,,}, with 


An = logio [Wh] — n? logis c, 


is bounded and has exactly three limit points 0, 210g; 3, logy, 2, 


1 


corresponding, respectively, to n = 


H 


2 or 3 (mod 6). 


The values of W,, (for n = 2k > 20) in the table below have been 


kindly provided to me by J.S. Frame. 
Wendt determinant: 

W, = 1, 

E 

= 98-22? x7, 

= —375 =—3 x 53, 

= 3751 = 11? x 31, 

= 0, 

= 26 x 29? x 127, 

= —37 x 53 x17, 

= 2? x 7 319 x 37? x 73, 

= —3 x 11°? x 31, 

= 23° x 67? x 89 x 1992, 

= 0 


= 
I 


= —2?4 x 3 x 296 x 433 x 1273, 


ao © e Ww N ka o 
| 


= —3" x 53 x 7° x 17" x 2573, 


ana NN © 
Hog dog 


EE 


N 


x 4499 x 65537°, 


= —3 x 5% x 11°? x 313 x 41° x 61, 

= —3 x 23?! x 67° x 89? x 199° x 6833, 

= —3” x 53° x 798 x 131 x 521? x 2731? x 81915, 

= —260 x 3 x 58 x 136 x 29?! x 433 x 1139 x 127? x 197°, 
= —3" x 5 x 76 x 1715 x 47° x 971? x 193° x 257° x 353° 


= 36 x 53? x 79° x 131? x 521° x 8191, 


= 2! x 7 x 11? x 317 x 61f x 151 x 271°, 
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Was = —3 x 103!? x 13712 x 2399 x 3076 x 4099 x 613° x 3571° 
x 43691? x 131071, 

Was = —3 x 76 x 19112 x 22912 x 4196 x 4576 x 6476 x 7615 
x 14839 x 93499 x 174763? x 524287*, 

Wao = —3!9° x 5°4 x 1115 x 31° x 413° x 241° x 2811? x 641° x 881° 
x 10485773, 

Was = —3 x 53 x 2318 x 676 x 89?! x 1996 x 397? x 6179 x 10136 
x 2113? x 23336 x 32576 x 32576 x A95 TA x 15709, 

Was = —3 x 47^% x 13912 x 4619 x 5991? x 6911? x 8296 x 11519 
x 23476 x 43575 x 178481? x 27962033, 

Wso = —3 x 115 x 10174 x 15118 x 251? x 4019 x 601? x 11511? 
x 13019 x 16019 x 19519 x 38519 x 46519 x 58019 x 6101° 
x 44561? x 558313, 

Wz = —3?5 x 53 x 5351 x 796 x 1319 x 157^ x 233° x 313 
x 521° x 677° x 13019 x 1613? x 2731? x 67099 x 8191? 
x 134175 x 205936. 


In 1991, Fee and Granville computed the factors of (the modified) 
W,, for every even n < 200 (including when 6 divides n). From these 
calculations, it followed: 


(2K)  Ifk «100, if p and 2kp -- 1 are primes, then the first case 
of Fermat’s last theorem is true for the exponent p. 


PROOF. The result follows from the explicit determination of the 
factors of W24 (for 2k < 200) and the verification that the conditions 
of (2B) are satisfied. 


From the values of Wa (for 2k = 2,4,8,16) it follows at once 
that Fermat’s congruence X? + Y? + Z? = 0 (mod q), where p and 
q = 2kp + 1 are odd primes, has only the trivial solution. However 
X? + Y? + Z? =0 (mod 31) and X? + Y? + Z? =0 (mod 43) have 
nontrivial solution, because 31 | Wio and 43 | Wu. 

According to a result of Dickson (1909) (see Chapter X, (2C)), if 
p,q are primes and q > (p— 1)?(p— 2)? + 6p — 2 then the congruence 
(2.1) has a nontrivial solution. Therefore, if q = 2kp + 1 and 6 J//2k 
then by (2B) q | Wag. 

We conclude this section by referring to statements made by Gan- 
dhi (1975, 1976) if the first case of Fermat's theorem is false for 
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p. These conditions are expressed in terms of appropriate circulant 
determinants, however the proofs were never published, due to the 
untimely death of Gandhi. 


1843 


1853 


1871 


1894 


1895 


1909 


1909 


1910 


1920 


1935 


1935 


1935 


1961 


1968 
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IV.3. Appendix: Sophie Germain’s Primes 


We have mentioned that it is a very difficult problem to know whether 
there exist infinitely many primes p such that 2p + 1 (or 4p + 1, or 


8p + 1, 


etc ...) is also a prime. A heuristic argument points to the 


validity of a much more general statement, as we shall explain. 


If x 


is any positive number, we denote by m(x) the number of 


primes p such that p < x. The famous prime number theorem of 
Hadamard and de la Vallée Poussin (1899) states that 


n(a) 


im 
z>% pz/logz  ^' 
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which we write also as m(x) ~ x/log x. 

Dirichlet’s theorem states that if a,m are relatively prime (pos- 
itive) integers then there exist infinitely many primes in the arith- 
metic progression {a + km | k > 0). If 7,,,(x) denotes the number 
of primes p in this arithmetic progression and such that p < x, then 


that is, 

1 x 
v(m) logz 

Considering the polynomial f(X) = mX + a, aan LI) represents 
the number of integers n, 1 € n € (x — a)/m such that f(n) is a 
prime. 

More generally, we may consider the following situation. Let 
f(X), fo(X), ... , LUX) be polynomials with integral coefficients, 
and positive leading coefficient. Let d; > 1 be the degree of f;( X). 
Assume moreover that these polynomials are irreducible over Q and 
that none is a constant multiple of another. Let N be any positive 
integer and let Q(N) = Qj... 5, UN) denote the number of integers 
n,1<n<N, such that fi(n), fa(n),... , f(n) are primes. 

The probability that a large positive integer m be a prime is 
n (m) /m ~ 1/log m, by virtue of the prime number theorem. 

Since we shall be interested in the values of the polynomials f; (.X ), 
fo(X),... , fa X), we have to discount the fact that k-tuples of such 
values are not randomly distributed. 

If p is an arbitrary prime, let s, denote the chance that none of 
the integers of a random k-tuple be divisible by p. Then 


—1\* 1\* 
EE 
p p 
Similarly, let r, denote the chance that for a random integer n, none 


of the integers fi(n), fa(n),... , Frin) be divisible by p. If w(p) de- 
notes the number of solutions of the congruence 


AX) fo(X) +++ fs(X) = 0 (mod p), 


then r, = (p — w(p))/p = 1 — w(p)/p. It may be shown that the 
product [],, rp/sp is convergent, say to a limit C = C(fi,... , fx). If 
we agree that this number measures the extent to which the values 


Ta,m (x) (Dé 
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of fi(X),... , fi (CX) form a nonrandom k-tuple, then the probability 
that fi(n), fa(n),... fr(n) are all primes (for large n) is equal to 


1 


log fi(n) ++ - log fi(n) 
1 1 
AS C( fis. hr, S (log n)*" 


CL fuss: SEH 


since log f;(n) ~ d;logn. Then 


1 x I1 
QW) ~ Cin f) 331 2. 


k —, (log n)* i 


In particular limyn>œ Q(N) = oo. Thus, we see in a heuristic way 
that there should exist an infinite number of primes p with the re- 
quired properties. 
Now we consider the following special cases: 
(1) A(X) = X, fo(X) = 2X+; and 
(2) A(X) =X, fo(X) = X +2. 
Case (1) deals with Sophie Germain’s primes while case (2) refers 
to twin primes. In both cases w(2) = 1 and w(p) = 2 if p > 2. Hence 
the constant is 


2 
tal pu 

C= Z II P , = 13203236, 
(1- 3) p>2 EE 


SO 


N 

N) ~ 1.3203236 -i 

This expression had been conjectured by Hardy and Littlewood 

in 1923, for the count of twin primes less than N. It agrees rather 

closely with the actual number of twin primes; see Sexton (1954), 
Wrench (1961), Shanks (1962), and Brent (1975). 
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V 
Interludes 5 and 6 


In this chapter, we give more background material. 


V.1. p-Adic Numbers 


A. The Field of p-Adic Numbers. In order to study divisibil- 
ity properties by a prime p, it is often convenient to consider the 
development of integers in the base p: 


a = ag + ap + ++ + amp”, 


with 0 < a; < p — 1, p" < a < p", 

Hensel also introduced infinite p-adic developments. The new 
numbers so defined are the p-adic integers. He described the op- 
erations of addition and multiplication among the p-adic integers 
and proved a very important theorem concerning the existence of 
p-adic integers which are roots of certain polynomials. 

The p-adic numbers may be considered as being limits of sequences 
of integers, relative to the p-adic distance. These considerations al- 
lowed the introduction of methods of analysis in the study of divisi- 
bility properties. 
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We shall describe here very briefly the concepts of p-adic numbers 
and the results which we shall require to discuss Fermat’s equation. 

In Chapter II, 81, we have defined, for every prime p, the p-adic 
valuation v, of Q. We recall that the set of p-integral rational num- 
bers is 


Zp = [5 | a,b Z,b 0 p b, ged(a,b) = 1} U {0} 


It is a subring of Q containing Z. Moreover, if r € Q then v,(r) > 0 
if and only if r € Zp. 

Zp is called the valuation ring of vp. The only maximal ideal of 
Zp is equal to Z,p = {a/b € Zp | p divides a}. The field Z,/Z,p is 
isomorphic with the field F, (with p elements); it is called the residue 
field of vp. 

The valuation v, defines, on the field Q, the function d, given as 
follows. If a,b € Q, then 


d,(a,b) = p- when a z b, 
d, (a, a) = 0. 
It is easy to verify the following properties: 
d,(a,b) > 
d,(a,b) = i a), 
d,(a,b) = d,(a — b, 0), 
d, (a 4- c,b +c) = d, (a,b), 
d,(a,b) < max(d, (a, c), d,(b, c)}, 
d, (a; c) + dp(b ? c): 
So d, is a distance, compatible with the operation of addition. It is 
called the p-adic distance of Q. 
The completion of Q relative to the p-adic distance is again a field, 
denoted Q, and called the field of p-adic numbers. The nonzero 
elements a of Q, are represented by p-adic developments 


oo 

H 

a= > iP , 
i—m 


with 0 < a; € p— 1, m € Z, and am £ 0. If an = Y ip" 
(with n > m) then a = lim, a, (the limit is relative to the p-adic 
distance). 


< 
< 


V.1. p-Adic Numbers 145 


The p-adic valuation may be extended by continuity to a valuation 
Ôp of the field Qp, which is defined by 


Op (x 7) =m_ (ifam #0). 


Thus, the values of à, are also integers or infinity. 

The topological closure of Z, in the field Q, is a ring, denoted by 
Z. Its elements are called the p-adic integers. Thus a € Ô, is a p- 
adic integer exactly when &,(a) > 0. It is also clear that Z,UQ = Zig: 
The only nonzero prime ideal of Zs is Z p, consisting of the multiples 
of p. The residue field of ô, is Za /L,p, which is isomorphic to the 
field F,. 

fa, pE Q,, we say that a divides G if there exists y € Te such 
that ay = f; this means that Gol < 6,(8). a € Ê, is a unit in 
Z, when a divides 1, i.e., &(a) = 0. The set U, of units of Z, is a 
multiplicative group. 

If a, 6,7 € Qp, y £9, we write a = p (mod y) if y divides a — f. 
Similarly, if y € Q,, y # 0 and F(X), G(X) e Q,[X] we write 
F(X) = G(X) (mod y) when y divides each coefficient of F(X) — 
G(X). 

These congruence relations satisfy the usual properties of congru- 
ences of integers. 


B. Polynomials with p-Adic Coefficients. We discuss briefly 
polynomials with coefficients in the field Q,. 

If f(X) = X” + a,X" +... ca, € Q,[X], we make the 
definition &,(f) = mino<i<n{v,p(a;)}. If f, g € Q,[X], with g 4 0, we 
define 

5a ES M 

i (7) =a- 9). 
which is well defined. Then &, is a valuation of the field Q,(X), 
whose restriction to Q, is the valuation v,. For simplicity, we shall 
write vp, instead of ùp. 

If fig € Q,[X], we write f = g (mod p") when v,(f — g) > n, 
or equivalently, p" divides each coefficient of f — g. For every f = 
Doux € Z,[X], we denote by f = f mod p the polynomial 
Sox e EX. 
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We recall now some well-known facts about polynomials in Q,[X]. 
The polynomial f € Se is said to be primitive when v,(f) = 0. 
Every polynomial f € Z,[X] may be written as f = afi, where 
a € Z, fie Z,[X] and f; is primitive. 


(LA) Gauss’ Lemma. /f f,g € EE are primitive polynomials, 
then f -g is also a primitive polynomial. 


(B) Iff € Z,[X] is primitive and f = g- h with g,h € Q,[X], 
then f = fi: gi for some primitive polynomials fı, gı € Z |X], such 
that deg o = deg g, deg hı = deg h. 


The nonconstant polynomial f € Z,[X] (respectively, f € Q,[X]) 
is irreducible in Z,[X] (respectively, in Q,[X]) if it is impossible to 
write f = g-h, with g, h nonconstant polynomials in Z,[X] (respec- 
tively, Q,[X]). 


(C) IIe SL then f is irreducible in KEN if and only if it 
is irreducible in Q,[X]. 


The nonconstant polynomials f,g € ZX | are said to be relatively 
prime whenever, if h € Z |X] and h divides f and g, then deg(h) = 0. 


(D) Iff € Z,[X] is nonconstant and primitive, and if f does 
not divide the nonconstant polynomial g € Z |X], then f and g are 
relatively prime. 


(E)  Iff,g € Z,[X] are nonconstant and relatively prime, then 
there exist polynomials s,t € Z,[X] such that s: f+t-g is a nonzero 
element of Zp. 


(AF) fffgkhe Z,[X], if f is irreducible and if f divides g - h, 
then either f divides g or f divides h. 


(1G) Ifg,he EN are nonconstant and relatively prime, if g or 
h is primitive and both g and h divide f, then g - h divides f. 
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(1H) Every nonzero polynomial f € 2X], may be written as 
a product f = agi-::9m, where a € De; and g1,... ,Qn € KEN 
are primitive irreducible polynomials with n 2 0. Moreover, a and 
Di, Ou are uniquely defined up to a unit in Zp. 


Now we shall consider the resultant and the discriminant of poly- 
nomials in Z,[X]. 


(11) Let f,g be nonconstant polynomials in Z,|X]. Then the 
following conditions are equivalent: 
(a) There exists a nonconstant polynomial h € Z, X] which di- 
vides both f and g. A 
(b) There exist nonzero polynomials fı,gı € Z [X] such that 


deg( fi) < deg( f), deg(gi) < deg(g), and 


n:ft+fieg =O. 


PROOF. (a) = (b) We assume that f = h- fı and g = —h- o 
with P. ou € Z,[X]; then dest bh) < deg(f), deg(gi) < deg(g), and 
o f+heg=0. 

(b) = (a) Conversely, we assume that there exist polynomials 
f; and oe Seet such that deg(fi) < degt f), deg(gi) < deg(g), 
and gif + fig = 0. If f,g are relatively prime, then by (1E) there 
would exist polynomials s,t € 2% such that s-f+t-g=ce 
Zi, with c Z 0. Eliminating g from the above relations, we obtain 
f(sfi— tg1) = cf; where deg(fi) < degt f), which is impossible. 


(19) In order that f = 37^ ,a,X"^* and g = 37; 9 b; X” (where 
m,n > 0, and f,g € Z,[X]) have a common nonconstant factor, it 
is necessary and sufficient that R(f,g) = 0. 


PROOF. It was seen in Chapter II, (4B), that if f, g have a common 
nonconstant factor, then R(f, oi = 0. 

Conversely, by (11) it is equivalent to show the existence of nonzero 
polynomials f; € Z,[X],g. € Z,[X], fi = Do aX", g = 
$5770 d; X"-1-*, such that gı - f + fı- g = 0 (it is not excluded that 
Co = do = 0). This relation is equivalent to the following system of 
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m+n equations in the unknown quantities Co,... , C4 3, do, ... , dm—1 
(obtained by equating to zero the coefficients of the powers of X): 


doao + Cobo = 0, 
doa, + diao + Cody + C19 = 0, 
doaz + dia + dan + Cob2 + Cb, + Cobo = 0, 


This homogeneous linear system has a nontrivial solution in Q, if 
and only if the determinant of its matrix vanishes, or equivalently, 
the determinant of the matrix obtained after exchanging rows and 
columns vanishes; in other words, R(f,g) = 0. Now we finish the 
proof by noticing that if there exists a nontrivial solution in Q,, 
by multiplying by the common denominator of these elements, we 
obtain a nontrivial solution in Lp: 


(1K) Let f,g € Z [X] be relatively prime nonconstant polynomials, 
such that v,(R(f,g)) = p. Then every nonzero polynomial h € ZX" 
such that v,(h) > p and deg(h) « deg(f)--deg(g) may be written in a 
unique way ash = gy f+ fi: g, where fi, gı € Z4 XL Up( fi) > vy (h)— 
p, Up(gi) Z Up(h) — p, deg(fi) < deg(f), and deg(gi) < deg(g). 


PROOF. Let 
f = pro 4m" 
i=0 


Y bx 
i=0 


g = 
m--n-—1 
h= Y exte 
i=0 


We want to determine f, = 5: GM E SOT d; X^-1-* in 
Z,|X] such that h = gı- f + fi: g. Comparing the coefficients of X in 
both sides of the above relation, we obtain a linear system of m+n 
equations in the m+n unknown quantities c;, d;, whose determinant 
is exactly R(f, g). 

Since f, g are relatively prime, by (1J) we have R(f,g) 4 0; hence 
the above system has a unique solution. 
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The coefficients c;,d; may be computed by Cramer’s rule; their 
numerators are linear forms in the e; with coefficients in 2, (because 
fig € ox ]), and their denominators are equal to R(f,g). From 
vp(e;) > Up(h) > 0 = v, (R(f, all, it follows that v,(c;) > 0, v,(d;) > 


0, so that fi, g1 € Z,[X] and v (fi) > v(R) — p, Gell > v,(h) — 
p. 


(1L) ` Let g € Z,[X] be a nonconstant polynomial. In order that 
there exist a nonconstant polynomial g € Z |X] such that g? divides 
f, it is necessary and sufficient that Discr(f) = 0. 


PROOF. In fact, if g? divides f, then g divides f and P. hence 
Discr(f) = R(f, f) = 0. Conversely, if Discr(f) = 0, by (1J) there 
exists a nonconstant polynomial f € Z,[X] dividing f and f'; by 
(1H), we may assume that g is irreducible. We have f — g- h, hence 
fl = g'-h+g-h'; since g divides f’, it follows that g divides g’-h; from 
deg(g') < deg(g) we see that g does not divide g', hence g divides h 
(by (1F)), and so g? divides f. 


We shall now investigate the behavior of the resultant R(f,g) 
when f,g are replaced by sufficiently close polynomials, relative to 
the metric defined by the valuation v, on Q,( X). 


(1M) If fig, fum € EN are nonconstant polynomials and 
up(fi — f) 2 a, vlg — 9) 2 B, then v,(R(fi,9) — R(f.g)) 2 
min{a, 8). 


PROOF. Let a € Ê, be such that v,(a) = min(vy( fi — f),v,(g1 — 9)); 
then fı = f + ah and gı = g + ak, where h,g € Z,[X]. Thus 
R(fi,g1) = R(f ah, g--ak). Writing the eliminating matrix between 
f+ah, g+ak, and computing the determinant, we obtain R(f,g)+as, 
where s € Ly is a certain sum of products of elements equal to a or 
to coefficients of f,h,g,k. Thus 


vp R(fi, gi) — Rf, g)) 2 (a) 2 min{a, 8j. 


(1N) | With the above notations, if f, fı € Z,[X] are nonconstant 
polynomials and v,(f — fi) > a, then v,(Discr(f) — Discr(fi)) > a. 
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PROOF. In fact, since Discr(f) = R(f, f^), Discr(fi) = R(fi, fi’), 
we have only to remark that v,(f; — f) > o implies v,(fi' — D ) >a. 
Indeed, if m = max{deg(f),deg(fi)} and f = Xito ai X", fi = 
oio bi X", then 


m-1 


m-—1 
f = 5 0 (ee Fi zx 3 Im 1)b; d EL 
i=0 


i=0 
thus v,((m—1)(b;—a;)) > v,(b;—a;) for every i = 0,... , m—1, and so 
volfi — f") 2 min{u,(b;—a;) : i =0,...,m—-1} > u(fi— f) > a. 


We also have at once: 


(10) If fig € EN and f = f modp,g mod p, then the 
resultant of f and g (computed in F,[X]) is R(f,9) = R(f,g) and 
the discriminant of f (computed in F,[X]) is Discr(f) = Discr( f). 


We say that the monic nonconstant polynomials f,g € Z, [X] are 
relatively prime modulo p when f, g are relatively prime polynomials 
in F,[X]. Similarly, f is said to be irreducible modulo p whenever f 
is an irreducible polynomial of F,[X]. Every polynomial f € Lp [X] 
is congruent modulo p to a product of polynomials in EN ], which 
are irreducible modulo p, and are uniquely defined modulo p. 

With these definitions, we have: 


(P) figeE Z, [X] are relatively prime modulo p if and only if the 
resultant is a unit in Zp, i.e., if and only if v,(R(f,g)) = 0. 


PROOF. By definition, f,g are relatively prime modulo p when f,g 
are relatively prime polynomials in F,[X]; by Chapter II, (4B), this 
means that R(f,g) 4 0; by (10), this is equivalent to R(f,g) Z 0, 
that is, p does not divide R(f, g), or equivalently, v, (R(f, g)) = 0. 


(1Q) Let f, g € Z |X] be irreducible polynomials modulo p. Then 
p divides R(f,g) if and only if f = g (mod p). 


PROOF. In fact, p divides R(f, g) exactly when f, g are not relatively 
prime modulo p; hence there exists a nonconstant polynomial À € 
Zp|X] such that f = h-fı (mod p), g = h-gı (mod p); by hypothesis, 
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we must have f = h (mod p), g = h (mod p), hence f = g (mod p). 
'The converse is trivial. 


We observe that if f = $5774 € Z,[X] and H is a multiple of p, 
then f = Y;-oam X” (mod p). Indeed, the coefficients of f' are 
jaj; thus if p divides f’, then p divides a; when p does not divide j. 
Therefore, 


f= » ona" (mod p). 


i>0 


In particular, if f is irreducible modulo p, then f’ 4 0 and hence we 
may consider the discriminant of f modulo p. 


(AR) Ire Z,[X] is irreducible modulo p, then p does not divide 
Discr(f). 


PROOF. We write f = f1+pf where all the coefficients of fı are not 
multiples of p. Then f’ = f;'+pf.’ and by (1N), Discr(f) = Discr( fi) 
mod p. If p divides Discr( f), then p divides Discr(f,) = R(fi, fi’). 
By (1P), there exists h € Z,[X] such that À is nonconstant and h 
is a common factor of Pichi Thus f, = h-9, rox = h-k with 
g,k € Z,[X]. Since f, = f is irreducible, then g = € with c € Z,. So 
[d F = f,-k. Therefore 


deg(f,') < dent A) = deg(F,) € deg(f, Ei = deg(f, ). 


which is absurd. 


(S) Let f € Z,[X] be such that f is not constant. Then f has a 
multiple irreducible factor modulo p if and only if p divides Discr( f). 


PROOF. We have f = g1gs::: gn (mod p), where gi, go, ... , gn are 
irreducible modulo p. Hence, by (1M) and Chapter II, (4D), 


Discr(f) = Discr(gige : +: gn) 
ee Il Discr(g;) - J [[R(9:; g;)]? (mod p). 


By (1R), p does not divide Discr(g;) for 1 < à € n. Then p divides 
Discr( f) if and only if there exist indices i < j such that p divides 
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R(g:,g;). By (1Q) this means that g; = g; (mod p), concluding the 
proof. 


C. Hensel's Lemma. This very important result, proved by Hensel 
in 1908, is certainly the raison d'étre of the p-adic numbers. It as- 
serts the existence, under appropriate conditions, of p-adic roots of 
polynomials. We shall prove here Hensel's lemma in its strong form: 


(AT) Let F,g,h € Z,[X] be such that: 

(i) deg(g) = m > 0, deg(h) = n > 0, deg(F) = m+n, g is 

monic and degt E — gh) < deg(F); 

(i) v, (R(g, R)) = p > 0; and 

(iii) v, (F — gh) = a > 2p. 
Then there exist G, H € È |X] such that v,(G — g) > a — p, v,(H — 
h) > a — p, deg(G) = deg(g), deg(H) = deg(h), G is monic, H,h 
have the same leading coefficient, and finally, F = G- H. 


PROOF. We shall prove the following assertion, for 7 > 0: 

(*) Ifg,h € 2,[X], deg(g) = m, deg(h) = n, g is monic, deg(F — 
gh) < deg(F), v,(F — gh) > a+ j, and v,(R(g, h)) = p, then 
there exist polynomials g*,h* € Z IX], such that deg(g*) « 
m, deg(h*) < n, v,(g*) > a +j — p, Up(h*) > a + j — p, and 
Up (F — (g+ g')(h - h') 2a j-1. 

Indeed, since v,(R(g,h)) = p then R(g, h) Z 0. By (1J), g and h 
are relatively prime. 

We note that v,(F — gh) > a+ jap and deg(F — gh) < deg(F) = 
deg(g) + deg(h), and it follows from (1K) that there exist uniquely 
defined polynomials g*, hr € Z,[X], such that the following hold: 


F — gh = h*g + g*h, 

Og) > Ug = gh) =p 2 AF ip; 
vp(h*) > GT — gh) — pora 3-9 
deg(g*) < deg(g), 
deg(h*) < deg(h). 
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Therefore 
v,(F — (g + g')(h + h*)) = v((F — gh) — (h*g + g'h) — g'h*) 
= v(g*h*) 
> 2(a +j- p) 
= (o — 2p) + (a + 2j) 
>a+j+l. 


We apply this result, beginning with g = go, h = ho, j > 0, and 
letting gı = go + gj, hı = ho + hj. Then we apply the result for 
gı, hı, j = 1, obtaining go = gı + gj, ha = hı + hj and so on. 

We have to note that 


Up(R(gj41, hj+1)) = vp(R(g;,h;)) =p 


for every j > 0, because 


Up [R(g;+1; hj+1) — R(gj h;)] 2 min{vp(g*), (V) 
2 a+ J — p> Pp, 


as follows from (1M). 

Thus, we have the sequences of polynomials (9g;);»o and (h;)j>0 
such that deg(g;) = m, deg(h;) = n, each g; is monic, hj and h have 
the same leading coefficient, and finally 


Up(Gj41 — 93) 2 & +j- p, 
Up(hj41 — hj) 2 a j — p. 


Thus (9;);>0 and (h;);>0 are Cauchy sequences of polynomials of 
degree m,n, respectively. This means that if 


the sequences (b;;);>0, (ci;);>0 (for every i), are Cauchy sequences 
in Qp. Since Q, is complete, let b; = lim b;;, c; = limc;;, and G = 
Xio OX", H = amb 
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Then v,(G — g;) > a+ j — p, because 


G-gj-— Jim ES 29 
i=0 

and 

Up EN zung =p, 

i=0 
for every s > 1. Similarly, v,(H — hj) > a+ j — p for every j > 0. 
Finally, 
UF — GH) = v,[(F — g;h;) + (g; — G)H + g;(h; — HI 
min{v,(F — 9jh;), “plg; — G) + v,(H), 
vy(g;) + v(h; — H)} 

2 a+ J =P, 


V 


for every 7 > 0. It follows that F = GH with G monic, and H,h 
having the same leading coefficient. 


Now we give Hensel’s lemma in its more customary form: 


(1U) Let F,g,h € Z,[X] be such that: 

(i) deg(g) = m > 0, deg(h) = n > 0, deg(F) = m+n, g is 

monic and degt E — gh) < deg(F); 

(ii) g, h are relatively prime modulo p; and 

(iii) F = g-h (mod p). 
Then there exist polynomials G, H € Z,|X] such that G = g (mod p), 
H = h (mod p), deg(G) = deg(g), deg(H) = deg(h), G is monic, 
H, h have the same leading coefficient, and F = G H. 


PROOF. This is an immediate corollary of the preceding result. In- 
deed, by (1P), v,(R(g,h)) = 0. Since u(F — gh) > 1, the above 
result may be applied. O 


Another commonly encountered form of Hensel's lemma concerns 
the lifting of roots modulo p. 


(1V) Let F € Z,[X] with deg(F) > 1, let a € Z, be a simple root 
of the congruence F(X) =0 (mod p). Then there exists b € Z, such 
that b — à and F(b) — 0. 
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PROOF. By hypothesis, F = (X — a)h (mod p) where h(A) # 0 
(mod p). So X — a,h are relatively prime modulo p. By (1U), 


F = GH, with G monic, deg(G) = 1, G = X — a, thus G = X —b 
with b € Z,, b = a, and therefore F(b) = 0. 


We shall apply Hensel's lemma to the polynomial X?^! — 1: 


(1W) [fp is a prime then Zs contains p — 1 (p—1)th roots of 
1. More precisely, for every j = 1,2,... ,p — 1 there exists a unique 
element wj € Zp such that we — 1 and wj = j (mod p). 


PROOF. For every j = 1,...,p—1, j^ ! 1 (mod p, so X^ !-1z 
IPS GE — j) (mod p). Thus 1,2,... ,p — 1 are all the roots of the 
congruence, and they are simple. By (1V), for every j there exists 
w; € Z, such that wet = 1 and w; = j (mod p). 

For the uniqueness, we observe that if w € Dies we -—1,andwz 
k (mod p), then w must coincide with one of the roots of X?~' — 1, 
say w = wj; then j = w; =w =k (mod p), so j = k, i.e., w = wp. 


Let p be a prime and let (Z/p)* denote the multiplicative group 
of nonzero residue classes modulo p. Let Q denote the multiplicative 
group of (p — 1)th roots of unity in Z,. 

As a corollary, we have: 


(1X) The mapping which associates to each nonzero residue 
class j modulo p the (p — 1)th root of unity wj in 2 such that 
wj = j (mod p), establishes an isomorphism between the multiplica- 
tive groups (Z/p)* and Q. Moreover, wg is a generator of Q if and 
only if g is a primitive root modulo p. 


PROOF. Indeed, if 1 < j,k,h < p —1 and jk = h (mod p), by (1W) 
it follows that wjw, = wy (mod p). Since v; = j (mod p), the map- 
ping j modulo p — w; is an isomorphism. The last assertion is 
trivial. 


(1Y) With the above notations: 
(1) iF P —1 yr then Y eaw" = 0; and 
(2) ifp—1|r then P eow" =p= 1, 
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PROOF. (1) Let g be a primitive root modulo p, so w, is a generator 
of the multiplicative group Q. Then 


p-2 — q= Ur 
Doro g = E 
g T= wr 
wea j=0 g 


when p—1/r. 
(2) Ifp—1|r then w = 1 for every w € Q, hence gw” = 
p—l. 
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V.2. Linear Recurring Sequences of Second Order 


Let A, B be nonzero integers such that D = A? —4B #0. Let Ro, R, 
be given integers and for every k > 2 let Ry = AR, 41 — B Ry. 3. 

If A = 1, B = —1 and Ro = 0, Ri = 1, then Ry is the kth 
Fibonacci number. If A = 1, B = —1 and Ro = 2, Rı = 1, then 
Ry is the kth Lucas number. These sequences of numbers have been 
briefly considered in Chapter IV, $2. 

We shall now indicate several elementary properties of the se- 
quence of numbers (R;),>0. For our purpose, we shall assume that 
Ro = 0 and R, = 1. There are analogous results when Ro = 2, Ry = 
A. 

Part (1) of the following lemma was given by Siebeck (1846); it 
is a generalization of Binet's result for Fibonacci numbers (Chapter 
IV, Lemma 2.3). In 1878, Lucas published a classical paper on this 
subject; see also Lehmer (1930). For further references on recurring 
sequences, see Dickson (1920, Vol. I, pp. 393-411). 
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LEMMA 2.1. With above notations and hypotheses: 


(1) If o, are the roots of the equation X? — AX + B=0, that 
1s, 
A +VA?-4B À-—VA2-4B 
a A 


2 B = 2 , 


then for every k 2 0: 
ak — Ob 
a-p 
(2) If k, h > 1 then Ryin = Ry Ryu = BRR. 
(3) (o^ + B")? = DR? + AB^. 
(5) AR, + (a* + 8^) 2 2R,44 for h > 0, 
AR, — (a^ + 8^) =2BR,_, forh > 1. 
(6) If k > 1 is odd then 


GD2/ 4 
9k-1m == ae EN 
ks À ls + d 


h=0 


R; = 


ee e 
241 (ak + BR) = y | avo. 


h=0 


(7) If k 2 1 is odd and n > 1 then 


(k—1)/2 
k(k—h-—1 
R, = p*-0? pk Ke 

nt 2. nie oes 


| p^ ps-2^-1)/2 pk-2h. 
PROOF. (1) From o? — Aa+ = 0 it follows that a*t? = Aa*t! — 
Dok and similarly 8^*? = Ai) — BBY. Noting that a Z 0 since 
A? — 4B #0, by subtraction and division by a — 3, we obtain 


ob? — gk? 7 or! gini ak — Bk 


a — B a — 0 a — 0 
Let R, = (a* — 8*)/(a — B) for every k > 0, so Ri. = AR 
BR, for k > 0. Since the sequences (Rp)k>0, LE Jun satisfy the 


same recurrence relation and Rj = Ro = 0, R = Ri = 1, then 
Ry = Ri, = (aë — B*)/(a—k) for every k > 0. 
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(2) 
Fay Fui — BR a Fs 
7 (ok — B*)(a^^1 — g*) — aB(ak-! — 8*-1)(a^ — BP) 
P (a — 8p 


kth _ gk+h 


SE ee — Fin. 


o — p 


(o^ + 8^? — DR? = o^ 429^ gh + g^ el 


o^ + 29^ g^ Ba uen = (o?^ er 2a" g^ EE Bu 
4a” 8” = 4B", 
since (a — 8)? = D, a8 = B. 
(4) Leth = nk. If n = 1 it is trivial and we proceed by induction: 


Rentiyk = RnkRk+i = DR Ry 


is a multiple of Rx. 
(5) We have A = a + B, so by (1), 


AR, + (o^ + 8 


8 - p” hi gh 
= (a+ HE 5 (e +8") 
M qr ni. Caer E eee are eer 
= "m 8 
` QRh+1 
= PNEU E " : x 2 Ry. 
Similarly, 
AR), ES (a^ + e 
E zy hy ah 
= (a+ HE —8 - (a" +6”) 
" o^ e c ft — gt 1— aß” — aB + gent 
= = 5 
z E re BA S 2BR, 
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(6) Let k > 1 be odd. From 
2a=A+VD, 
28 — A— VD, 
by raising to the power k, and subtracting, 
nomea [mom 
c \2h+1 
Dividing by 2(a — 8) = 2VD, we have 
k-1 es k k-2h—1 ph 
Sou I "en D”. 


Similarly, by raising the expressions of 2a and 28 to the power k, 
by subtracting and dividing by 2, we obtain 


(k-1)/2 f y 
24-1 rra + 8^) - 5 (5) AF-?^ ph. 


h=0 
(7) By Chapter VII, (1D), we have the identity 


(k-1)/2 
,k(k-h-1 
k k 

Y) 
ROVE SREY M qus v SE 


h=1 
We take X = a”, Y = — 8”, and since k is odd, 
GES SE ha 


| X*Y”(X TY) ^7 


a^ ph (an-gye Y zu Jeme p 


Dividing by a — 3, noting that aß = B and a — 8 = VD then by 
(1) we have 


(k—1)/2 


Rx = D&- 0/2 RR + > 


k(k—-h—1 B^ pt 2h 1)/2 pk 2h. 
h\ n-1 5 


Now we investigate the divisibility properties of the terms of the 
recurring sequence (R;)x>0. 

If m > 1 and if there exists an index k such that m divides R;, 
we denote by r(m) the smallest such index. It is called the rank of 
appearance of m in the sequence (Rx)x>0- 
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LEMMA 2.2. (1) If m > 1 and gcd(m, B) = 1 then there exists 

the rank of appearance of m. 

(2) If m > 1 and gcd(m, B) = 1 then m divides Ryp if and only 
if r(m) divides k. 

(3) If m, n > 1, ged(m, B) = gcd(n, B) = gcd(m,n) = 1 then 
r(mn) = lem{r(m),r(n)}. 

(4) If p is an odd prime, p Y B, then R, = (D/r) (mod p) (where 
(D/p) denotes the Jacobi symbol). 

(5) If p is an odd prime, p | B, then r(p) divides p — (D/p). 

(6) If p is an odd prime, p Y B, if k = vp(Rr)) and e > k, then 
r(p*) = p*^*r(p). 

(T) Ifk 2 1,h x 1then Reng = Ri. (mod RP). 

(8 Ifk21,h 21 then Ry, = kR, Rý] (mod R2). 


PROOF. (1) Consider the set of pairs ((R; mod m, R;_1 mod m | 
k= 1, 2, ... . Since there exist only finitely many couples of residue 
classes modulo m, there exist integers k,l, with k < l such that 


R, = R (mod m), 
Ry, = Rp (mod m). 


Since BR,» = AR, ,— R; then BRp-2 = BRy_2 (mod m) hence 
Ry-2 = Ry_2 (mod m), because ged(m, B) = 1. Repeating this 
argument, it follows that 0 = Ry = R)_, (mod m), so there exists 
an integer l— k > 1 such that m | R,_,. Hence there exists a smallest 
integer r(m) such that m | Rec). 

(2) First we note that for every k > 2, if m divides R then 
gcd(m, Rx_1) = 1. Otherwise there exists a prime p dividing m, Ru 
and R;_1. From Rx = AR, ,; — BR,» and gcd(m, B) = 1, it follows 
that p | Ry». Repeating the argument, we would conclude that 
p | R4 = 1, which is a contradiction. 

Now let S be the set of all indices k such that m | Rg; by (1), 
S #0. We show that if k, h € S then k -- h € S. Indeed, by Lemma 
2.1(2), m | k+h. 

Similarly, if kj h € S, k < h, then h — k € S. In fact, Ra = 
Jun = Ry Ry ka = D Ry 4 Ry, hence m | B Ry 4, Ry as. But 
gcd(m, B) = 1 and, as shown above, gcd(m, R;_1) = 1, hence m | 
Ry as. 

This suffices to show that S is the set of multiples of its smallest 
element, namely r(m). 


V.2. Linear Recurring Sequences of Second Order 161 


(3) We have m | mn and mn | Rus so m | Rrimn) hence 
r(m) | r(mn); similarly r(n) | r(mn) hence | = Iem{r(m),r(n)} 
divides r(mn). 

Conversely, m | Ream) and r(m) | l hence m | Ry; similarly n | Ry, 
hence from gcd(m, n) = 1 then mn | Ri, that is, r(mn) | I. 

(4) By Chapter II, (3.3), we have 


Ry = = (a — By + pf = De + pf. 


Since D®-))/? = (D/p) (mod p) then R, = (2) (mod p). 

(5) If£p| D then (D/p) = 0 and R, =0 (mod p). Let (D/p) = 1; 
we show that R, ; = 0 (mod p) hence by (2), r(p) | p — 1. Indeed, 
by Lemma 2.1(5), and by (4) above, 


2BR, ,-— AR, — (o? + 6?)  A— (o? +a?) (mod p). 


By Lemma 2.1(6), 2?! (a? + 3?) = A? (mod p). Hence 


2BR,, = PBR, = PTA- AZ A- A? Z0 (mod p). 
Since p / B, p £ 2, then R,_1 =0 (mod p). 
Now let (D/p) = —1; we show that R,,1 = 0 (mod p), hence by 
(2), r(p) | p +1. Indeed, by Lemma 2.1(5), and by (4) above, 
2R +1 = AR, + (o? + 6?) = —A + (aP + 6") (mod p). 
By Lemma 2.1(6), 2?~'(a? + 8") = A? (mod p), hence 


2R,,,52?R,,,2 —2' At AP =—-A+ A? =0 (mod p). 


Since p # 2 then Rp+ı = 0 (mod p). 
(6) Let Rr) = p*t, with p Yt, k > 1. By Lemma 2.1(7), we have 
for m > 1, 


Rag, = DO "Zu 


(p™—1)/2 

prp fpe r(p)h ry(p" —2h—1)/2 pp" —2h 

e am CL h-1 E d Be» 
h=1 


For h = (p" — 1)/2 the summand is equal to p" B'(007-0/2 Rt, 
and its p-adic value is m + k, since p Y B. 
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If 0 < h < (p" — 1)/2 then 


p". [p^ —h —1) zr ry" —2h—1)/2 pp" -2h 
(Enea 


p" ppr- m 
> (nt) =m + on - 89s. 
However, if m > 1 and 0 < h < (p™ — 1)/2 then p" > 2h -- v (h) +1. 
Indeed, 2h = p" — p?r with p Yr, 0 € s < m and if s = 0 then r > 1; 
so 2h +v,(h) +1 2 p" — p?r - s4-l « p" because s +1 < p?r (since 
p # 2). Hence, 


m+ (p" — 2h)k — v,(h) > m + v(h)(k — 1) - k 2 m4 k. 
We have also 
pM > p^km-k, 


when m > 1 (because p Z 2). Thus, for m > 1 we have v, (E,(,») = 
m + k. This is also true when m = 0. 

Taking m = e — k > 0 then v, (E,(5),.—) = e so p° | R;(ppe-x and 
therefore, by (2), r(p°) | r(p)p®~*. Since p° | Rre) then r(p) | r(p*). 
Hence r(p°) = r(p)p" with 0 € m < e— k. If m < e—k then 
Gel Russe lz M++ k < e, so p° Y Rrp)pm hence r(p*) Z r(p)p" and 
this shows that r(p^) = r(p)p**. 

(7) The proof is by induction, being trivial when k = 1. We have 
by Lemma 2.1(2): 


Ju = Just 
= Renzi Ray — BR Ru 
= Reny Rh+ 
= REPU (mod R?) 


by induction and since R, divides Ry», (by Lemma 2.1(4)). 
(8) The proof is by induction, being trivial when k = 1. We 
have, by Part (7), by Lemma 2.1(2) and (4), and by induction: 
Retin = Renta = Rr Reny — BRr-1 Ren 
= R RE = BR kb Ri I 
= RAR Rng — BR] 
= R, R I(k + 1)Rr41 = (k +1)RaRi,, (mod Rj), 
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noting that 


1843 


1846 


1878 


1920 


1930 


1995 


Ru = R, Ro = RR, - = —BRj,4 (mod Ra). 
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VI 


Arithmetic Restrictions on 
Hypothetical Solutions 
and on the Exponent 


Let p be an odd prime and assume that x,y,z are nonzero pairwise 
relatively prime integers such that x? + y? + zP = 0. 

In this chapter we indicate congruences and divisibility properties 
satisfied by expressions involving the numbers z,y,z,p. In some 
instances, we will be able to reach a contradiction, proving that 
Fermat’s last theorem (or the first case) holds for certain exponents 
p. In Section 3 we focus on a conjecture of Abel, which has not yet 
been completely established by a direct proof. 


VI.1. Congruences 


Let p be an odd prime and assume that x, y, z are nonzero relatively 
prime integers such that x? + yP? + z? = 0. For easy reference, we 
recall results from Chapter ITI, 81. 

If p Y xyz then there exist nonzero integers r,s,t,r1, an, Du such 
that 


x+y=t, (a? + y)/(x + y) = t, z = tt, 
(L1) pear, (P +2)/(y+z)=r], ` worm 


z+x= sp, (2P +aP)/(z+x) = s], y= —-ss:, 
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p frstrisits and rs tr, $1, D are pairwise relatively prime. More- 
over rP + s? + t? £0. 

If p Y xy but p | z, then there exist nonzero integers r, s, t, r1, $1, t1, 
and n > 2 such that 


(1.2) 
z+y= pP, (x ty \/(et+y) =p, 2=—prth, 
ytz=r?, (y? + 2P)/(x + y) =r}, T --—rn, 
T+y= P, (2? +2P)/(x+y)= a, y= ss, 


p Y rstrisiti and r,s,t,7,,5,,1, are pairwise relatively prime. We 
have r? + s? + p?^—t? £ Q. 

Moreover: If p Y xyz then rı = 1 (mod 2p?), s, = 1 (mod 2p?), 
and tı = 1 (mod 2p°). If p | z, p Y zy then rı = 1 (mod 2p), sı =1 
(mod 2p), and tı = 1 (mod 2p?). If p /xyz then 


ge 
(1.3) y — —s? +k, 
z= —ť +k, 


where k = (r? + s? +4?)/2. If p |z, p yxy then 


Der +k, 
(1.4) y=- +k, 
z = —pP- HP + k, 


where k = (r? + s? + p?^-11»)/2. 

We begin with an easy congruence, soon to be reinforced. From 
x? = x (mod p), y? = y (mod p), z? = z (mod p) then —z = —z? = 
x? +YP = x+y (mod p), so z? +y? = —z? = (x 4- y)? (mod p?). 

The first result is due to Fleck (1909). It was given by Lind in a 
weaker form, in 1910. It was rediscovered by Frobenius in 1914 and 
again by Vandiver (1914), Pomey (1923) and Pérez-Cacho (1958). 


(LA) Letp be an odd prime, and let x,y,z be nonzero relatively 
prime integers, such that x? + y? + z? = 0. 

(1) If p does not divide x then x?! = 1 (mod p?). 

(2) If p /xyz then (x + y}? = x? + y? (mod nl, 
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PROOF. (1) Ist Case: p does not divide yz. As recalled above, 
ri = 1 (mod p°). Hence z = —rr; = —r (mod p?) and x? = —r? 
(mod p?). 

By symmetry, we have also y?” = —s? (mod p?) and z? = H 
(mod p?). Since à? + yP + 2? = 0 then r? + s? +t? =0 (mod p?). It 
follows from the above relations (1.3) that x = —r? (mod p?). Hence 
x? =x (mod p?) and z?^! = 1 (mod p?). 


2nd Case: Assume that p divides z, hence p does not divide y. 
As recalled above, tı = 1 (mod p°), hence t? = 1 (mod p?). 

On the other hand, pn — 1 > 4, so x = —y (mod p*) and pt? = 
(x? +y?) /(at+y) = x?! —aP ^y 4. —myP 7 ry" = pr”! (mod p*) 
hence x?! = t? = 1 (mod p?). 

(2 We have 0 = 2? + y? +2? = x + y + z (mod p?), hence 
x+y = -—z (mod pë). Then (x +y} = —z? = x? + y? (mod pf). 


With methods from Class Field Theory, as a consequence of the- 
orems of Furtwängler, Vandiver proved (1914, 1919): 


(1B) | With above notations: x? = x (mod p?), y? = y (mod p?), 
z? = z (mod p?), and x + y +z & 0 (mod pë). 


The result which follows is again due to Fleck (1909). Partial as- 
sertions were rediscovered by Pomey (1923), Vandiver (1925), James 
(1934), Niewiadomski (1938), and Inkeri (1946). 


(1C) Let x,y,z be nonzero relatively prime integers such that 
LP y? + 2? =O. 
(1) If p y xyz then x + y + z is a multiple of 6 and of rstp? and 
r+s+t is a multiple of p?. 
(2) Ifp | z then x+y+z is a multiple of 6 and of rstp? and r +s 
is a multiple of p, while r+ s+ t is not a multiple of p. 


Pnoor. It follows from the relations (1.1) (respectively, (1.2)), that 
in both cases r, s,£ divide x + y + z. Also, x + y + z is even and 
x? = x (mod 3), y? = y (mod 3), z? = z (mod 3), hence x+y+2 = 
x? + y? + z? = 0 (mod 3). 

In the first case, it follows from (1A) that x? = x (mod p?), uf = y 
(mod Hl, z? = z (mod p?), hence x + y +z = 0 (mod p?). Since 
p l/rst then rstp? divides x + y + z. 
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As it was recalled above, rı = 1 (mod p°), sı = 1 (mod p°), t, = 
1 (mod p?). It follows from relations (1.1) that x = —r (mod p?), 
y = —s (mod p°), z = —t (mod p°), hence r + s+t=0 (mod p°). 

Assuming that p | z, then p / zy, hence it follows from relations 
(1.2) that p" divides z and x + y (with n > 2), hence p? divides 
x+y +z; since p /rst then Hirst divides x + y + z. 

As recalled above, rı, = 1 (mod p), sı = 1 (mod p) so by relations 
(1.2), x = —r (mod p), y = —s (mod p). Hence r + s = —(xz +y) = 
z =0 (mod p). Thus r - s--tz tz 0 (mod p). 


The next proposition was proved, in the first case, by Spunar 
(1929) and James (1934); a simpler proof was given by Segal (1938). 


(1D) Let x,y,z be nonzero relatively prime integers such that 
L + y? + 2? =O. 
(1) If p /zyz then r - s -t z 0. 
(2) If p | z then r 4- s - p^t £0 and also r+ s - t ZZ 0 (where n 
was defined in (1.2)). 


PROOF. (1) Assume that p /xyz and r 4- s4- t — 0. Since gcd(r, s, t) 
= ] we may suppose, for example, that r,s are odd while t is even. 
Then r? + s? = (y+z)+(z+2) = z4y--2z = t? — 2tt, (by relations 
(1.1)). Hence 
PURO = RE 241 — t". 
r+s t 

The left-hand side is equal to r"^! — r?~?5+.---—rs?~? + s?^1, hence 
it is the sum of p odd numbers, so it is odd. On the other hand, 
2t, — t^! is even, which is impossible. Hence r + s +t Æ 0 in the 
first case. 

(2) Ifp |z then p /zy and by relations (1.2), r? +s? = (y + 2) + 
(zg+a2)=a+y4 22 = pH? — 2p'tt,. Hence, ifr+s+p"t = 0 
then 


IARE nf ues. Oty — mim, 

rcs prt 
Since p t; and n > 2 then p does not divide (r? + s?)/(r + s). On 
the other hand, by Chapter II, (3R)(4), gcd(p, r-- s) = ged(r--s, (r?+ 
s?)/(r--s)). By (1C), p | r+s, hence p divides (r?- s?)/(r4-s), which 
is a contradiction. From (1C), since p /r+s-+t, then r4- s--t 4 0. 
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In connection with this result, Raclis considered in 1944 the fol- 
lowing conjecture, where p is an odd prime: 


(R’,) If a,b,c are nonzero integers and pabc divides a? + bP? + c? 
thena+b+c=0 or a? +b +P — 0. 


Raclis showed: 


(1E) Let p be an odd prime. If (Rj) is true then the first case of 
Fermat’s theorem is true for the exponent p. 


PROOF. Assume that x,y,z are nonzero pairwise relatively prime 
integers, not multiples of p, such that x? + y? + z? = 0. Let r,s,t 
be defined as in relations (1.1), so r,s,t are nonzero integers. We 
remarked after (1.1) that r? + s? + t? £0, and by (ID), r+ s+ t £ 
0. By (1C), p divides r+ s + t. By (1.1) and (1.3), r,s,t divide 
r? + s? +t”, and since p, r, s,t are pairwise relatively prime then prst 
divides r? + s? + t”, showing that (Rj) is not true. 


The validity of (Rj) is very questionable. (R5) and (Rs) are false: 
p=3,a=b=c=1 and p = 5, a = 33, b = —2, c = —1 provide 
counterexamples. 

Similarly, consider the following statement: 


(R7) Ifa,b,c are nonzero integers and pabc divides a? + b? 4- pr tcp 
(for some n > 2) then a + b+ p"c = 0 or a? + b? + pmte = 0. 


Then: 


(1F) Let p be an odd prime. If (RY) is true then the second case 
of Fermat’s theorem is true for the exponent p. 


PROOF. Assume that x,y,z are pairwise relatively prime integers, 
such that p | z and x? + y? +2? = 0. Let r,s,t, and n be defined as in 
relations (1.2), so r,s,t are nonzero integers, n > 2. Then by (1D), 
r+s+p"t #0, and by the remark after (1.2), rP + 5? + pmte £0. 

However, as already shown in (1C), r+s = 0 (mod p), so p divides 
rP? + s? + pl, By (1.4), r, s, t divide r? + s? + p?"^!t? and since 
p,r,s,t are pairwise relatively prime then prst divides r? + sP + 
DT. This contradicts the assumption (R7). 
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In 1946, Inkeri generalized (1A). He also proved the corresponding 
result for the second case, with a more powerful method from Class 
Field Theory. 


(1G) [fp is an odd prime, n > 1, if x,y,z are nonzero relatively 
prime integers such that x?” +y?" +2?" — 0, and if p does not divide 
x then zx? ! =1 (mod p?"*!). 


PROOF. Since x,y, z are pairwise relatively prime, we may assume, 
for example, that p Ÿ y.We have 


a Lg" + 2?" = (yH z) Il Qp Ee 2) : 


m=1 


By Chapter II, (3C)(2), the factors on the right are pairwise rela- 
tively prime, so y+ z= a?" where a | x, p fa. Similarly, x + z = b" 
where b | y, p yb. 

If p Vz, we have similarly x + y = c" where c | z, p Yc and (a? + 
yP)/ (x + y) = d" where d | z, p / d and gcd(c, d) = 1. However, if 
p | z, by Chapter II, (3C)(3), x+y = pe, (x? + y^)/(x +y) = pd?” 
where cl z, d| z, p Yc, p fd, gcd(c, d) 2 1, h » 1. 

We show that if q is any prime dividing d (whether p | z or p yz) 
then q = 1 (mod p"*!). Indeed, since q | d then q | x^ -y^, q | z, qz 
p, and q Ke, hence q Ÿ x + y. So 


y a" (mod q), 
x =b" (mod q), 


hence q Va?” +b”. But op" +b?" = x? +y? =0 (mod q). There- 
fore q is a primitive factor of the binomial a?" ^ + MT". By Chapter 
II, (3G), q= 1 (mod p"*?). It follows that d= 1 (mod p"*'), hence 
d" =1 (mod p?"*?). 

To conclude the proof, we examine separately the two cases. 

If p Yx then (x? +y?)/(a+y) = 1 (mod p?"*!) hence z?--y? = x+y 
(mod p?"*1). By symmetry, y + 2? = y + z (mod p?”*") and x? + 
zP = x +z (mod p?"*!). Adding up these congruences, dividing by 
2, and subtracting the second one, we obtain x? = x (mod p?"*!), 
hence x?! = 1 (mod p?"*1). 

If p | z, let v(z) = k > 1, so v (27) = pk > n+1. By 
Chapter IL, (3C)(4), v,(x + y) = p"k — n, so x = —y (mod nr ^"). 
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A x? P 


= pz?! (mod penis 


But kp" — n > 2n +2, because p > 3 (otherwise the hypothesis is 
not satisfied); hence z^! = d?” = 1 (mod p?"*?). 


'The next group of results will involve congruences of the type 
(12-2) 21-4 a (mod p**!). 


It is convenient to precede the discussion with the following easy 
result, which is given explicitly by Ferentinou-Nicolacopoulou (1965): 


LEMMA 1.1. Let p be an odd prime not dividing the integer a. 
(1) ff n >m 20 then a? Za" " (mod mit, 
(2) If k 2 1 then a” — op = ap*-1(a?-! — 1) (mod p**!). 
If p also does not divide a + 1: 
(3) If k >1 then [(a 4- 1)" — a” — 1] — [(a + 1)?" — a 
= p*-![(a +1)? — a? — 1] (mod p**?). 
(4) If k > 2 then (a 4- Un = a?" +1 (mod p**!) if and only if 
(a 4-1)? =a +1 (mod p**). 
(5) If k > 1 and (a+1) = a?" 1 (mod p**?) then (a+1)" | = 


k—1 


a?” +1 (mod p**?). 


—1 


— 1] 


PROOF. (1) a?” - = a? = a (mod p), hence raising to the 


power p" 7: 


a" zg " (mod p^"). 
(2) a” af =a? "Ia @-) — 1]. If a?! = 1 + bp then 
(an) - = 1 + bp (mod pet), 


hence 


(ang ` 1= bp" = (ar 1) p" (mod p**!). 


Since a?" = a (mod p) then 


k k—1 


a? o  za(a" —1)p" (mod p**^). 
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(3) By (2) and the hypothesis, p does not divide a + 1: 
(a-- 1" — (a+ 1)" = (a+ 1 ((a+ 1) —1). (mod sr 


and similarly 


Therefore, by subtraction, 


k k—1 k—1 


[(a +1)" — a” — 1] — [(a +1)" sar — 1] 


(4) First note that if k > 2 then a” =a?" ' =a? (mod p?) and 
also 
(a +1)" =(a+1}” =(a+1)? (mod p). 


Thus 
(a +1)" — a” Ce 
= (a+1}? —a’—1 (mod p?) 
If 
(a+1) —a? —1=0 (mod p**!) 
then also 


The converse is proved in the same way. 
(5) By hypothesis, (a + 1)? = a? +1 (mod p**!) hence also 
(a+ 1) =a +1 (mod pt"). 


In particular, as was noted by Birkhoff and was published by 
Carmichael in his second note of 1913, if p does not divide a, nor 
a+1, then (1--a)? = 1+a” (mod p?) if and only if (1+a)” = 1+a” 
(mod p?). 

The following result was given by Klósgen in 1970: 


(1H) Let p be an odd prime, and m > 1. The following conditions 
are equivalent: 
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(1) There exist integers x,y,z, not multiples of p, satisfying the 
congruence 


zr?" +y +2 =0 (mod pt). 
(2) There exists an integer a, 1 € a € (p — 3)/2, such that 
1+a?" z(1-a)" (mod p"*!). 


Moreover, if any two of the numbers x,y,z are congruent modulo p 
then 28 1 =1 (mod p™*?), 


PROOF. We need only to show that (1) implies (2), since the other 
implication is trivial. Let z' be an integer such that z’z = 1 (mod p). 
Let a = z'z (mod p), —b = z'y (mod p), where 1 <a <p—1,1< 
b<p—1. Then 


aP” = 2P" pi (mod p"), RT = zP" P" (mod p"), 


so aP” +1= 0?" (mod p"*1). If 6b& a +t (mod p) then a?” +1 
br = (a +t)” =a+t (mod p) so t = 1 (mod p), and a?” +1 
(a--1)" (mod p™+!), with 1 «a «ac 1X p- 1. 

If a= (p — 1)/2 then a+ 1 = (p + 1)/2 = —(p — 1)/2 (mod p) so 


(p-1)" +2?" 2—(p—-1)" (mod p"*!) 


hence 2?" = 2 (mod p"'*!) and we take a = 1, since 1?" +1 = (1 + 
1)?" (mod p"*!). If (p —1)/2 < a € p-2 then taking a, = p-1—a 
we have 1 € a, € (p — 3)/2 and 


14a? =(1+a}" (mod p"). 


For the last assertion, we may assume, for example, that x = y 
(mod p), the other cases being similar. In the course of the proof, we 
had a = z'z = z'y = —b (mod p), b=a+1 (mod p) hence 2a = -1 
(mod p), so a = (p — 1)/2. Therefore 2?"-! = 1 (mod p”+#1). 


An immediate corollary of this result is the following. 

If the first case of Fermat’s theorem fails for the exponent p, 
that is, if there exist integers x,y,z, not multiples of p, such that 
x? + y + z? = 0, then we also have x? + y + z? = 0 (mod p°). 
There exists a, 1 € a € (p—3)/2, such that 1+ a? = (1 + a)? 
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(mod p?). Carmichael (1913) and Meissner (1914) obtained the fol- 
lowing more precise results (the statement for the first case is also 
given by Gandhi, in 1975). 


(11) Let x,y,z be nonzero relatively prime integers such that x? + 
y+ 2=0. 

(1) If p Y xyz there exists an integer a, 1 < a < (p —3)/2 such 
that 


(1+a)?=1+a” (mod p?), 


or equivalently, (1 + a)? = 1 + a? (mod p°). 
(2) If p | xyz there exists an integer a, 1 € a € (p — 3)/2, such 
that (1 + a)? = 1 + a? (mod p°). 


PROOF. (1) By (1A), z? = x (mod p?) so 2?” = a? = x (mod p?). 
Similarly y” = y (mod p?) and 2° = z (mod p?). Hence, by (1C), 
xP Lu Lat =xt+y+z=0 (mod p?). 

By (1H) there exists a, 1 < a < (p — 3)/2, such that (1+ a)” = 
1+ a” (mod p?) and by Birkhoff's remark following Lemma 1.1, 
(14- a)? =1+a? (mod p?). 

(2 If p | xyz, we may assume, for example, that p / xy, p | 
z. According to (1C), x + y +z = 0 (mod p°). Then (x + y)? = 
—z? = xP + y?" (mod p?). Since x # 0 (mod p), y # 0 (mod p), 
there exists an integer b, 1 < b < p — 1 such that y = bx (mod p). 
Then y? = bz? (mod p?) and (x+y)? = x?(1+ 6)? (mod p?). Thus 
x?(1 +b)? = x?(1 +b”) (mod p°). Since p Yx then (1 +b)? = 1-- b? 
(mod p?). 

As in (1H), if 1 € b € (p—3)/2 we take a = b, if (p— 1)/2 < 
b < p— 1 we take a = p — 1 — b and if b = (p — 1)/2 then we take 
a — 1. 


RTS 


We remark that the above criterion for the first case is useless 
when p = 1 (mod 6), as pointed out by Birkhoff (in Carmichael’s 
second note, 1913). For this purpose, we establish the following easy 
lemma: 


LEMMA 1.2. Let p be an odd prime. Then p = 1 (mod 6) if and 
only if p Æ 3 and there exists an integer t, 1 < t € p — 1, such that 
t?+t+1=0 (mod p). 
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PROOF. Let p = 6k +1, let g be a primitive root modulo p. Let t 
be such that 1 < t € p — 1 and t = g?* (mod p). Sot #1 (mod p), 
but t? = 1 (mod p). Hence P? -t-- 1 = (t? —1)/(t—1) 0 (mod p). 

Conversely, if t?--t--1 = 0 (mod p), then (2t-- 1)? = 4t?+4t+1= 
A(t?+t+1)—3 = —3 (mod p). So —3 is a square modulo p Z 3. Hence 
1 = (—3/p) = (p/3) so p = 1 (mod 3), hence p= 1 (mod 6). 


Now we use the following fact established by Cauchy in 1841: if 
p = 1 (mod 6) then the polynomial pX (X 4- 1) (X? 4- X +1)? divides 
(X +1)? — X? — 1 (see Chapter VII, (2A)). 

Thus, if p = 1 (mod 6), by the lemma there exists t, 1 < t < p— 1, 
such that t? +t+1 = 0 (mod p). It follows that (1 ++)? = 1 4 t? 
(mod p?) and by a remark of Birkhoff, (1 + t)? =1 +t” (mod p?). 
Proceeding as in the proof of (1H), there exists a, 1 < a € (p — 3)/2, 
such that (1 +a)” =1 +a” (mod p?). 

This establishes the assertion that if p = 1 (mod 6) then the cri- 
terion of (1I) is useless in the first case. 

Wagstaff verified in 1975 that for every prime p < 100000, p = —1 
mod 6, the congruence (1 + x)? = 1 + x” (mod p?) has no solution 
in integers a,1 < a € (p — 3)/2. In this way, it was proved that the 
first case is true for such exponents. 

In the same year of 1975, Gandhi had independently suggested 
that such computations be performed. (11) may be rephrased as 
follows: 


(1J) Let g be a primitive root modulo p. 
(1) If 14-gi*" +g"? £0 (mod p?) for all indices j,k = 1,... ,p—1 
then the first case of Fermat’s theorem holds for the exponent 
p. 
(2) If 14- g/? +g £0 (mod p?) for all indices j,k =1,...,p—1 
then Fermat’s theorem holds for the exponent p. 


Pnoor. (1) If the first case fails for the exponent p, there exists 
a,l € a € (p—3)/2, such that (1 + a)” = 1 + a” (mod pë). 
Note that a Æ 0, —1 (mod p). Let j,k be indices such that a = ol 
(mod p), -ü + a) = g* (mod p). Then a” = g” (mod p?), (1+ 
a)” = —g*” (mod p?) and therefore 1 + alt + g^? 2 0 (mod p°). 
(2) If Fermat’s theorem fails for the exponent p, there exists a, 1 < 
a € (p — 3)/2 such that (1 + a)? = 1 + a? (mod p°). We conclude 
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similarly that there exist indices j,k such that 1 + g/? + g^? = 0 
(mod p°). 


In 1950, Trypanis announced without proof the following strength- 
ening of Carmichael's result. It was rediscovered by Ferentinou- 
Nicolacopoulou in 1965 (we present her proof) and generalized by 
Klósgen in 1970 (see (1J)). 


(1K) Ifthe first case of Fermat's theorem fails for the exponent 
p > 5, there exists an integer a, 1 < a € (p — 5)/2, such that 


(1+a)” =1+a? (mod p’), 
or equivalently 


(+a) =1+a (mod p?). 


PROOF. Assume that z, y, z are positive integers, not multiples of p, 
such that 


z? + y? — zh. 
Let t be the order of p modulo z, so t is the smallest positive integer 
such that 

p=1 (mod zl, 
We write pt — 1 = dz, for some integer d > 0. Clearly p does not 
divide d Let m = dx,n = dy, hence p does not divide m,n and 


m? +n? = (pf UE, 


So m,n, —(p' — 1) satisfy Fermat’s equation. By (1C), m+n = p'—1 
(mod p?). 

Let us note that m,n are less than p' — 1 and since (m+n)? > 
m? +n? then m+n > p — 1, so p! « m4 n4- 1 < 2p‘. This implies 
that t > 4, because if t < 3 then 


m+n=p'—1 (mod p^, 


that is, m+n 4- 1 2 0 (mod p'), which is not possible. 
Since m+n = —1 (mod p?) then (n -- 1)? = —m? (mod p*). Also 
from m? + n? = (p' — 1)? we deduce that 


n?+1=—m? (mod TI), 


VI.1. Congruences 177 


therefore n? + 1 = —m? (mod př), and combining with the previous 
congruence, 

(n+1)?=n?+1 (mod pf). 
By (1A), m?! = n?! = (pt — 1)! =1 (mod p?) and fromn+1= 
—m (mod p?) we have 


(n4-1)7 zm?" !zm"z1 (mod p)). 


As is known, there are y(p*) = p?(p — 1) invertible residue classes 
modulo p?, and they form a multiplicative cyclic group. If the residue 
class of w modulo p? is any generator of this group, from 


nPz1 (mod p) and = (n+1)?'=1 (mod p?) 
it follows that the orders of the residue classes n mod p? and (n+1) 
mod p? divide p — 1; so there exist positive integers h, k, 0 < h, k € 


p — 2, such that : 
n=w"? (mod p?) 
and | 
n+1=w*? (mod p). 
Let b = w^ (mod p). Then 


nz =b (mod p) 


and 


k 


n+1=w'” =w* (mod p), 


so w^ =b+1 (mod p) and we deduce that 
n? =b (mod p) 
and : 
(n+1)?=(b+1)” (mod p?) 
Since (n + 1)? = n? +1 (mod pê) it follows that 
(b-- 1)" 2 b +1 (mod p?). 


Let us note that p does not divide b nor b+ 1. If 1 < b < (p—5)/2 
we take a = b. If b = (p — 3)/2, from 


Er) ERY +1 D 


we deduce that 
je" +2 (mod p^), 
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hence we may take a = 2. If b = (p—1)/2 then 


esty «y wan 


2 2 


hence 
zx 29 (mod p), 


so we take a = 1. If (p+1)/2 < b < p—1, let a = p — 1 — b, so 
1<a<(p—3)/2 and 


GHD? Sar =1-— (b+1)? =1+a” (mod pf). 


By Lemma 1.1(4), it follows that 


(a+1) =a” +1 (mod pî). 


Let us note that according to Lemma 1.1(5), (1K) is in fact a 
strengthening of (11). 

From these results, we obtain as an immediate corollary the one 
indicated by Gandhi (1976): 


(1L) [If the first case of Fermat’s theorem fails for the exponent p 
then there exists an integer b, not a multiple of p, such that (1+6)? = 
1+ H (mod pf). 


PROOF. By (1J) there exists a, not a multiple of p, such that 
(ia =1+a” (mod p?). 
By Lemma 1.1, 
(1+a)?=1+a? (mod p). 
Raising to the pth power: 
(1--a)" =(1+a?)’ (mod p’), 


so 
(1+a?)?=1+a” (mod p^). 
Letting b = a? then (1 +b)? =1+ b? (mod pf). 
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Bréié-Kostié proved in 1952 a result of the same kind, under some 
special conditions on the prime exponent p. Klósgen proved in 1970 
the following generalization of (1K): 


(1M) If n > 1, if x,y,z are integers, not multiples of the odd 
prime p, such that x?" + P^ + 2?" — 0, then 


a?" pyP" +2 20 (mod p", 
or equivalently, there exists an integer a, 1 < a < (p —3)/2, such 


that 
(+a) 21- a?" (mod sënn) 


PROOF. By (1G), x? = x (mod p?"*!). Raising to the power p" : 

a" = x?" (mod p?"*!). Again, x? sm = a?” (mod p?"*!) 
s e 3n n * d 

and repeating this procedure, r" = x? (mod p?"*!). Similarly, 


y" = yP” (mod p?"*!) and sp" = z"" (mod p?^*!). Hence 


oP" py" pz ma" Lu +2" se D (mod p". 


The last assertion was proved in (1H). 


Taking n — 1, we obtain (1J). 

Johnson investigated (in 1977) whether the congruences of Carmi- 
chael and Trypanis may be further strengthened, modulo every power 
p"? (n > 1). This is a typical situation to be handled by p-adic 
methods. ) 

Let a; € Z, denote the unique p-adic integer which is a (p — 
1)th root of unity and is such that a; = j (mod p), for every j = 
1,2,...,p— 1 (see Chapter V, (1W)). 


(1N) ` Let p be a prime, p > 3, and let a be an integer such that 
p does not divide a nor a +1. Then the following conditions are 
equivalent: 

(1) for every n > 1: (1+ a) =1 +a” (mod p"*?); 

(2) A+ A = O14a; and 

(3) à? +a+1=0 (mod p). 


PROOF. The p-adic development of a, is of the form a, = a+ pap, 
with p, € Zp and p, is uniquely defined by ag. We show that for 
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every n > 0 we have the congruence 
(1.5) Qa =a?” + p,p"*! (mod si". 


For n — 0 it is trivial. We assume that (1.5) is true for n. Then from 
a?-! = 1 it follows that 


Qa = (a)? = (Loft + pap) 


a E + pap” t?ar" (9-0 


aë + pap"? (mod p"*?), 


since of 0-9 = 1 (mod p). 

Now we show the equivalence of the statements (1), (2), (3). 

(1) — (2) For every n > 1,1+a° = (1+ a)" (mod p"*?). 
Hence by the above congruence (1.5): 


1 + Qa = 1 + a” + Pap" = (1 + a)" + pap (mod pt’). 


On the other hand, ou = (1 +a)?” + pizap"*! (mod p"*?). Hence 
1 + Qa = Qia (mod p"*!) for every n > 1. From the uniqueness of 
the p-adic development it follows that 1 + a, = Qi+a- 

(2) — (1) We have 


Qita Z (1 + a)" + Gen" (mod po) 
and 
1+a, =1+a" 4+ pap"t! (mod p"*?). 


By hypothesis, ou = 1+a,, so (1 + a) + p3,4p = 14+ (a + pap). 
By the uniqueness of the p-adic developments, Pia = pa, hence 
(1 +a)?" =1+a (mod p"*?). 

(2) — (3) The p-adic (p — 1)th roots of unity constitute a mul- 
tiplicative cyclic group. Let a; be a generator. We consider the 
subfield Q(a;) of Q,. It is a Galois extension of Q. Let o be the 
automorphism such that o(a;) = a;'. Since a, = (a;)* (for some 
exponent k) then o(a,) = (a;) * = a,!. Similarly o(a144) = Oo, 

From the hypothesis 1+ a, = 0444, we deduce by applying © that 
1+a;!=a;},. Hence 


AaQAi+ta + Cito = Qa; 


SO 
Qa 02 1c 04 = A 
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and 
o? +a,+1=0. 
Since a, =a (mod p), it follows that a? + a +1 = 0 (mod p). 

(3) — (2) Since p Z 3 and à? + a +1 = 0 (mod p) then a z 1 
(mod p). Multiplying with a — 1 we have a? = 1 (mod p). Then 
(aa)? = 1 (mod p). But (a)?! = 1, so a? = 1, which is the unique 
(p — 1)th root of unity in Z, congruent to 1 modulo p. Since o, Æ 1, 
it is a primitive cubic root of 1, and therefore 


o2 +a,+1=0. 


So aa +1 = —a? = —a? = a4 1 (mod p), and therefore o, + 1 = 
04.1, the unique (p — 1)th root of 1 in Z, which is congruent to a+ 1 
modulo p. 


Let us note that in view of Lemma 1.1, condition (1) above is 
equivalent to: 


(1) Foreveryn 2 1: (Lia =1+a"" (mod p^??). 


According to Lemma 1.2, if p > 3 there exists an integer a, 1 < 
a X p — 1, satisfying the equivalent conditions of (1N) if and only if 
p= 1 (mod 6). 

As a corollary, we have: 


(10) Ifp is a prime, p=5 (mod 6), then there exists an integer 
no > 0 such that if n > no and a = 1,2,...,p —2 then 


(1 a)" z1--a" (mod p"??). 

PROOF. By Lemma 1.2, if a = 1,2,...,p—2 then a +a+1 #0 
(mod p). By (1N), for every a there exists an index n(a) such that 
Ota 14a” (mod p"(9*2), 

By Lemma 1.1(5), if n > no = max{n(a) | a = 1,2,...,p — 2} then 


(1+a)" 41+?" (mod p"**?) 


for every a=1,2,...,p—1. 
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It is interesting to find out whether the following implication is 
true: 


(I) Ifa is an integer, a = 1,2,...,p —2, and (1 + a)? =1+a? 
(mod p?) then a? +a+1=0 (mod p). 


Indeed, if (I) is true then the first case of Fermat's theorem holds 
for every exponent p=5 (mod 6). Because, otherwise by (11) there 
exists a, 1 < a € p —2, such that (1+ a)? = 1-- a? (mod p?). Hence 
by (D), à? +a+1=0 (mod p) and therefore p = 1 (mod 6), contrary 
to the hypothesis. 

Arwin showed in 1920 that there exist integers a and primes p 
such that (1 + a)? = 1 + a? (mod p?) but a? +a+1#0 (mod p). 
So such a strengthening of the implication (I) is false. 
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VI2. Divisibility Conditions 


Let p be an odd prime and assume that z, y, z are nonzero relatively 
prime integers such that x? + y? + z? = 0. In this section we shall 
indicate some divisibility conditions which the integers x, y, z or some 
of their combinations must satisfy. 

The following proposition was given by Pérez-Cacho in 1958. How- 
ever, statement (2) had already been proved by Massoutié in 1931; 
a simpler proof was also given by Pomey in 1931. 


(2A) ` Let p be an odd prime number and assume that there exist 
nonzero pairwise relatively prime integers x,y,z such that x? + y? + 
2? = 0. 

(1) If 3 does not divide xyz then x = y = z Æ 0 (mod 3), the 
integers x? — yz, y? — xz, 2? — xy are divisible by 3, but not 
by 9, and if q is a prime, gz 3, dividing one of the numbers 
x? — yz, y? — zz, z? — vy, then q = 1 (mod 6). 

(2) If p = —1 (mod 6) then 3 divides xyz. 


PROOF. (1) Since 3 Y xyz then x,y,z are congruent to 1 or to 
—] modulo 3. From (+1)? + (+1)? + (£1)? = 0 (mod 3) the only 
possibility is that x = y = z Æ 0 (mod 3). Therefore x? = yz 
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(mod 3), thus 3 divides x? — yz, and similarly 3 divides y? — zz, z? — 
xy. 

We note that one, and only one, of the integers x,y,z is even. 
Thus £? Æ ys, y^ Æ zz, 2^ 3 xy. 

We show that 9 /z? — zy. We have z?? + xPy? +xP2? = 0 and zy = 
(zy — 22) + z? so —(z?? + x? 2?) = z"y? = Tou — 22) + 27]? = p(xy — 
22)22(-U + 22 (mod (2? — zy)?). Noting that x,y,z are distinct 
(since 2 is not a pth power), we have 


Qs(z?, 2?) = P + oP 2? +z? = p(z? — ry) (mod (z? — xy)?). 


Since 3 | x? — z?, by Chapter II, (3B)(6), we have v3(Q3(z?, z?)) 
= v3(3) = 1. Thus 9 Y2? — zy. 

Now let q be a prime dividing z? — zy, q # 3 (the argument is 
similar if q divides z? — yz or y? — zz). Then q / z (otherwise q | 
z and q | x or q | y, contrary to the hypothesis). Let z’ be such that 
22 =1 (mod q). Multiplying with z’?? we have (x?z/?)? + (x?2/?) + 
1 =0 (mod q). By Lemma 1.2, q = 1 (mod 6). 

(2) If3 /xyzand p Jz (the argument is similar when p Ÿ x or p Yy) 


then 
2p pap 
2 S ry 
PA 4 SS — 
Q,( , y) 22 RE zy 
2p p 4p 2p 
x HE e 
= E - " = p?) = p (mod z? — zy). 


Also z? — zy = z?? — xPy? (mod p). If p yY z? — xy then from the 
above congruence, p = 1 (mod z? — zy) so p = 1 (mod 3), by (1). If 
p | z? — zy, by (1), p & 1 (mod 3). 


Pomey claimed in 1931 to have shown that for any exponent p, 
if x,y,z are nonzero and x? + y? + z? = 0 then 3 divides xyz. He 
also claimed in 1934 that 5 divides xyz. However, his proofs were 
erroneous (see Brauer, 1934). 

Inkeri has proved in 1946 the following statement: 


(2B) [fp is an odd prime, p Z 1,9 (mod 20), if x,y, z are nonzero 
integers such that x? + y? + zP = 0 then 5 divides xyz. 


PROOF. By hypothesis, p Æ 5 and we may assume that x,y,z are 
relatively prime. 


186 VI. Arithmetic Restrictions 


If 5 /xyz then z?, y, z? are congruent modulo 5 to +1, +2. Chang- 
ing notation, if necessary, we may assume that x? = y or — y 
(mod 5). If we had x? = —y? (mod 5) then z? = —ax? — yP = 0 
(mod 5), contrary to the hypothesis. Therefore, x? = y? (mod 5). 

Let h be an integer such that ph = 1 (mod 4). If p= 3 (mod 4) 
we may take h = 3. Raising the above congruence to the power h, we 
deduce that x = y (mod 5). Since 2? = —ax? — y? = —2y? (mod 5) 
then again z?? = 4y? = —x?y? = (—xy) (mod 5) hence raising to 
the power h, z? = —xy (mod 5). In particular, since z = y = +1 or 
+2 (mod 5) then z? = +1 (mod 5). 

We examine now the integer z?? + x?y?, which is necessarily odd, 
since exactly one of the integers x, y, z is even. 

We show that if q is a prime, q Æ 5 and q divides z?? + xPy? then 
q = +1 (mod 5). Indeed, we have the relation (2x? + 3y?)? — 5y?? = 
A(x?P + 3gyPyP + y?) = A(z 2 + gPyP). So for such a prime q, we 
have the congruences (2x? + 3yP)? = 5y?? (mod q). So 5 is a square 
modulo q. By the quadratic reciprocity law, 


1= (2) = (2) thus g=+1 (mod 5). 


It follows that any factor k of z? + xy? which is not a multiple of 
5 must be congruent to +1 (mod 5). In particular, we take 


ates ay? 2(p—1) 2(p—2) 


= M MEE = 2(p—3) Se p-1 
k TS z z ry+z (xy) + (xy). 
Since 2? = —ay (mod 5) then k = pz?®-) = p (mod 5) because 
2? = +1 (mod 5). Thus k z 0 (mod 5) and therefore k = +1 


(mod 5). 

We conclude that p = +1 (mod 5). 

We have still to show that p = 1 (mod 4). If we assume that 
p = 3 (mod 4) then 3p = 1 (mod 4) hence z = z2? = (—2)%y*? = 2y 
(mod 5). Considering the relation (22? + 3y?)? — 5y?? = 4(z?P + 
Auf zf + y?) = 4(x° + yPz?) we deduce as before that every prime 
factor q #5 of z?? + y?z? must be congruent to +1 (mod 5). Hence 
every factor k of x” + y?z?, k #0 (mod 5), must be congruent to 
+1 (mod 5). 

In particular, taking k = x? + yz, if p = 3 (mod 4) we have 
y = —2z (mod 5) hence x? = y? = —2yz (mod 5) thus k = x°?+yz = 
—yz £0 (mod 5). Therefore k = +1 (mod 5). However, on the 
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other hand k = à? + yz = y? + 2y? = 3i? = +2 (mod 5), which is a 
contradiction. This shows that p = 1 (mod 4). 

Therefore, p = 1 or 9 (mod 20), contrary to the hypothesis, show- 
ing that 5 | xyz. 


Pérez-Cacho proved in 1958: 


(2C)  lfp is an odd prime number, if x,y,z are nonzero relatively 
prime integers such that x? + y? + z? = 0 then: 
(1) if q 45 is a prime factor of (x? — yz)(y? — zx)(z? — xy) then 
q = +1 (mod 10); and 
(2) none of the numbers x? — yz, y? — zx, z? — ry is a multiple 
of 25. 


Inkeri also proved in 1946 the following result: 


(2D) Let p be an odd prime number, and assume that x,y,z are 
relatively prime nonzero integers such that zf Lut +2?" = 0 (where 
n 21). Then: 
(1) 5 divides xyz(x — y)(x — z)(y — z). 
(2) 7 divides xyz(a — y)(x — z)(y — z)(a? — yz)(y? — xz)(z? — xy) 
(if p > 3). 
(3) 11 divides xyz(x — y)(x — z)(y — z) (3? +yz)(y? +xz)(2? + xy) 
(if p » 5). 


PROOF. First we note that if | is a prime and p / 1 — 1, if a,b are 
nonzero relatively prime integers then / | a+b if and only if { | a? +b’. 
Indeed, if a or b is a multiple of l, it is obvious. 


Let l Yab. 

It is clear that if l | a+b then l | a? + bP. Conversely, if l | a? + b? 
but l Ya +b then lis a primitive factor of a? +b” and by Chapter II, 
(3G), {= 1 (mod p), which is a contradiction. 

Let u = x? , v = y", w = z" , so we have u + v + w = 0 and we 
need to show (in view of the above remark, because p > 11): 

(1) 5 divides uvw(u — v)(u — w)(v — w). 
(2) 7 divides uvw(u—v)(u—w)(v—w)(u? —vw)(v? —uw)(u? —uv). 
(3) 11 divides 

uvw(u — v)(u — w)(v — w)(u? + vw)(v? + ww)(w? + wv). 
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Let l = 5,7, or 11. 
Since gcd(u,v,w) = 1, we may assume, for example, that l yv. 
Let v' be such that v'v = 1 (mod l). Hence multiplying with v’ we 


have t+t’ = 1 (mod l), where t = —v'u (mod l), t  —v'w (mod I). 
Let 

Tis. 

Ty =t-1, 

T, =t+1, 

Ti = 2—1, 

T, = t— 2, 

Ts = Ê -—t+1, 

T; — te; 

Ty = P —t-—1, 


Ty = ? —3t+1. 
Then 
UT, = —u (mod J), 
vI = —u—v=vw (mod J), 
vT; = —u- v (mod J), 
vl, = —2u — v = —u + w (mod J), 


vT; = —u — w = w — v (mod l), 


v?Ty = u? + vu +v’? = u? — vw 

= v — uw = w — vu (mod l), 
v^T, = u? — vu — v? = u? + vw (mod J), 
v^Ty = u? + vu — v? = —v? — uw (mod J), 


v^Ty = u? + 3vu +v? = w + uv (mod I). 


(1) Let | — 5, then it is easy to verify: if 


(mod 5) then 5] 7}, 
(mod 5) then 5] Ts, 
(mod 5) then 5] Ts, 
1 (mod 5) then 5 | 73, 
2 (mod 5) then 5| 7%, 
) 


ci ch cR ck oc 
II HL HE HE I 


hence 5 divides uvw(u — v)(u — w)(v — w). 
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(2) Let | = 7; then it is easy to verify: ift = —3, —2, —1, 0,1, 2, 3 
(mod 7) then 7 divides T4, Ts, T3, T1, To, T5, Te, respectively. Hence 
7 divides uvw(u — v)(u — w)(v — w)(u? — vw)(v? — uw)(u? — vu). 

(3) Leti = 11; then if t = —5, —4, —3, -2, —1, 0, 1, 2, 3, 4, 5 
(mod 11) then 11 divides T4, Tz, Tg, To, T3, Ti, T», Ts, Tz, Ts, To, re- 
spectively. So 11 divides uvw(u — v)(u — w)(v — w)(u? + vw)(v? + 
uw)(w? + uv). 


Let us note that the argument breaks down for l > 11. Indeed, in 
this case if t = —2 (mod l) then the values modulo l of T; (1 € i € 9) 
are distinct from 0 and have absolute value at most equal to 11. 

Concerning divisibility by 4, we indicate a result of Pierre (1943), 
preceded by a lemma on Jacobi symbols. 


LEMMA 2.1. Let a,b,c be pairwise relatively prime odd integers such 


(9-Q9-(9-« 


Then at most one of the numbers a,b,c is congruent to 3 modulo 4. 


Proor. Assume that a = b = 3 (mod 4). Then by the reciprocity 
law for Jacobi's symbol: 


[s E 
zG 

ke) 

Hence (a/b) = (a/c) = (b/c) = (b/a). However 


G-E- (9) 


which is a contradiction. 


1 


(2E) Let p be an odd prime number, and let x,y,z be nonzero 
relatively prime integers such that x? + yP + 2? =0. Then 4 divides 
one of the numbers x, y, z. 
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PROOF. We may assume z even, while x, y are odd; we also assume 
z = 2 (mod 4) to derive a contradiction. 


First Case: p xyz. 


By (2.1), 
p D 
gus zt +T 
z+ 
= SÉ Pg 4e SET za? ts 
= Aaf" +r”! (mod 4). 


—2(4k + 1)? + 1 = F2 +1 = 3 (mod 4), and therefore sı 
(mod 4). Similarly, r? = (z? + y)/(z + y) = 3 (mod 4), so ri 
(mod 4). 

By Chapter II, (3A), (r1/s1t1) = (s1/rıt1) = (t:/s171) = 1, hence 
by Lemma 2.1 at most one of the numbers r1, s1, tı may be congruent 
to 3 modulo 4, which is a contradiction. 


Since x is odd then x?~' = 1 (mod 4) hence s? = —2x?-? + 1 


Second Case: p | xyz. 

First we consider the case when p | z. As in the first case, sı = 3 
(mod 4), rı = 3 (mod 4). 

By Chapter III, (2A), the odd integers r1, s, pt; satisfy (r1/pt151) 
= (sı /ptırı) = (pti/risi) = 1. This contradicts Lemma 2.2. 

Now we assume that p Y z and, for example, p | x. Proceeding 
as before, by (1.2) we have pr? = 3 (mod 4) and since rı is odd 
then r77! = 1 (mod 4), so pr; = 3 (mod 4). Moreover, s, = 3 
(mod 4) and by Chapter III, (2A), the odd integers pri, s,, D satisfy 
(pri/siti) = (si/priti) = (ti/prisi) = +1, and this contradicts 
Lemma 2.1. 


In 1910, Lind claimed that 9 divides x + y + z, but his proof 
was insufficient. As a consequence there are several inequalities and 
equations in his paper which are questionable (see Dickson, 1920, p. 
769). 

After these divisibility results by small numbers, we turn our at- 
tention to divisibility results by expressions built from the numbers 
x,y,z which are hypothetical solutions of Fermat’s equation. 

In 1913, Niewiadomski showed: 
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(2F) Let p be an odd prime number, and let x,y,z be nonzero 
relatively prime integers, such that x? + y? + 2? — 0. Then z??*! + 
y?P*1 4 z?P*1 is divisible by (x + y)(y + z)(z + x). 


PROOF. We have 


pa uen ua c (SAY Ln NN +) p 
LP HYP YPZ? 2P4P 


as may be easily verified. Hence 


LPH yee 0 (yP PPa) (aP P (et a) 
(x +y)ly + z)(z +x) (y + z)(z + x) (z+ y)(z+2) 
(a? + y?)(y? +2”) 
(z+ y)(y + 2) 


and this number is an integer. 


The following result of Rameswar Rao (1969) is also very simple: 


(2G) fp is an odd prime number, if x,y,z are nonzero pairwise 
relatively prime integers such that x? +y? +2? = 0, then x+y divides 
d? where d = gcd(x +y,z) (similar statements hold by symmetry for 
rz-cz,y-z). 


PROOF. From x + y + z= 0 (mod p), there exists k # 0 such that 
z+y+z = kp. Since p is odd then x + y = kp — z divides both 
xP + yP = —z? and (kp)? — z? so kp — z divides (kp)?. Since d = 
gcd(kp, z), we may write kp = ud, z = vd with gcd(u,v) = 1. But 
u — v divides both u?d?^! and v?d?^! hence (u — v)d = x + y divides 
d. 


The following result, which appeared in a paper by Simmons 
(1966) was attributed to G. Reis; the assertion (1) was proved again 
by Rollero (1981): 


(2H) Let p be an odd prime number, and let x,y,z be pairwise 
relatively prime positive integers such that x? + y? = z?. Then there 
exist uniquely defined positive integers k,a,b such that x = k+a, y = 
k+b,z=k+a+t+b. 

Moreover: 
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(1) pab divides k?. 

(2) gcd(a, b) = 1. 

(3) If a Z 1 then gcd(k, a) Z 1; if p Va then gcd(k, a) Z a. 
(4) If b #1 then gcd(k,6) Z 1; if p Yb then gcd(k, b) Z b. 


PROOF. We may assume 0 < x < y < z. Since z < x + y, let k be 
defined by x + y = z + k. From y < z we have k < x < y. Let a,b 
be defined by x = k +a, y = k +b. Soz=k+a+b. 

It is clear that k, a,b are uniquely defined: if x = k’+a’, y = k'+b', 
and z = k'+a' +b' then 0 = (k— k')+ (a-a’) = (k— k') + (b — b') = 
(k — k')+ (a — a!) + (6-0) so k = k', a = a', b = b'. We have 


(2.1) (k +a)? + (k +b)? = (k + a +b). 


It is easily seen that gcd(k + a, k + b, k + a + b) = 1 because 
gcd(z, y, z) = 1. Therefore k+a, k+b, k+a+b are pairwise relatively 
prime. 

Since (k+a+b)? = (k+a)?+p(k+a)?—'b+---+0? then (k+b)? = 
p(k + a)?71b +... + 0? hence k? + pk?-"b + (iii. 
p(k + a)?-'b+-+++ 6? so 


kP = pbÂ(E + ay? Cl d IA a)? erc 


ta 1 o^ 
p—1 


Each bracketed expression is a multiple of a, thus k? is a multiple of 
pab. Ifa 4 1 then gcd(k, a) Z 1, if b # 1 then gcd(k, b) Z 1. We have 
gcd(a, b) = 1 for if a prime q divides a,b then it would also divide k, 
hence k +a, and k + b. 

If p Ja then gcd(k, a) Z a otherwise k = al, | an integer, and from 
(2.2) we have, after dividing by a, 


a? = pha" [(L-- 17! — P=] + B FED op 
Tee pb? 1, 


Hence a divides pb?^!. But p Noa hence a divides b?~', a contradic- 
tion. Similarly, if p /b then gcd(k, b) Z b. 
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Let p be an odd prime and assume that z, y, z are nonzero pairwise 
relatively prime integers such that x? + y? + z? = 0. 

Fleck (1909, 1910) began a more systematic study of divisibil- 
ity properties of the following numbers (built from the hypothetical 
solution (x, y, z)): 


A=ytyz+2’, 
B = 2t+zr+2', 
C = a^ + ry +y’, 


A, = x? — yz, A» = x? + yz, 

B, = y? — 2x, By = y? + zz, 

Ci = 2? — Ty, Co = 2? + TY, 
S=atytz, T = —-(xy +yz + zz), 
U = xyz, 2V = r? + y? + 22. 


Let r,s,t be defined as Barlow relations (Chapter III, 81). 


(21) With the above hypotheses and notations, there exist nonzero 
integers G, M, J, K, L, Ji, Kı, L4 such that 


(1) 


g= { —rstp’GM (in the first case), 


—rstp GM (in the second case). 


(2) A == GJ, B — GK,C — GL, A, — GJ,, Bi = GK, Cı = 
GI. 

(3) G is the greatest common divisor of S and the six expressions 
above. 

(4) J, K, L, Ji, Ki, Ly are pairwise relatively prime. 

(5) The prime factors of J, K, L are of the form 6hp +1. 

(6) The prime factors of Ji, Kı, Lı are of the form 6hp? +1. 

(7) £? = y? = 2°? (mod GJKLJ,K,L:). 


4 


In 1979, Inkeri gave corrections to some proofs of Fleck and ob- 
tained further results along the same lines. 

We conclude the section with a result of Pollaczek, obtained in 
1917. No elementary proof is known for it. Pollaczek’s proof was 
based on congruences obtained by Kummer in 1857, which should 
hold if the first case of Fermat’s theorem is assumed false for the 
exponent p. Kummer’s proof of these congruences involves a detailed 
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consideration of arithmetical properties of the cyclotomic field Q(¢,), 
where ¢, is a primitive pth root of 1. It is therefore not included in 
this book. 

Here is Pollaczek’s result: 


(2J) 


Let p be an odd prime number and assume that there exist 


pairwise relatively prime nonzero integers x,y,z, x? + y? +2? = 0. 
Then À = y? +yz+2?, B= z2 E zz 4 27, and C = x? + zy +y? are 
not divisible by p. 


1857 


1909 


1910 


1910 


1913 


1917 


1920 


1931 


1931 


1934 


1934 


1943 
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VI.3. Abel’s Conjecture 


Abel stated in 1823 that if x, y, z are nonzero relatively prime integers 
such that 0 < x < y < z and z^ + y" = z” (n > 2), then none of 
z, y,z are prime-powers. No direct proof of this statement has ever 
been discovered. However, we shall see that it is correct when n is 
not a prime number or when n = p is a prime not dividing xyz. 
'This last assertion will not be proved in this book, since it requires 
analytical methods. 

Many partial results, obtained by various authors, are summarized 
in Table 7. 

In 1887, Mansion claimed to have shown that if the exponent is 
an odd prime that x is not a prime. His proof was erroneous. In 
1891, Lucas published a proof that if n is arbitrary then x cannot 
be a prime-power; but his proof was incomplete, as pointed out by 
Markoff in 1895. In 1955, Moller established a theorem containing 
all the above results. However, the proof on page 27 of his paper 
was insufficient. We give below a simpler and correct proof: 


(3A) Letn 23 be an odd integer with r distinct prime factors. If 
1 € x < y are relatively prime integers and a = y” +x”, b= y” — x”, 
then: 


(1) a and b have at least r distinct prime factors; 


196 VI. Arithmetic Restrictions 


(2) If a has exactly r distinct prime factors then a = 2? + 1? (so 
ns: = 1); and 
(3) If b has exactly r distinct prime factors then y = x +1, b = 


(x 4 1)* — z^. 
'TABLE 7 
Year Author Exponent n7 2 Result 
1857 Talbot arbitrary (I) y, z are not primes 
(II) if x is a prime then 
z—y-l 
1884 Jonquières arbitrary (I) and (II) 
1887 Borletti odd prime z is not a prime 
even 7,y,z are not primes 
1901 Gambioli arbitrary (II) z not a prime-power 
not a power of 2 
1905 Sauer arbitrary y,z are not prime-power 
1932 Mileikowsky arbitrary y, z are not prime-power 
not a prime (III) x not a prime-power 
1949 Izvekoff arbitrary (1) 
1952 Bini odd z not a prime-power nor 
equal to nq, q prime, Z n 
odd prime z is not a multiple 
of n, z is not equal to 
q1d2::: qr where qı 
< qo <+: < qr are prime 
and q? > Zog: qr- 
PROOF. (1) Let pı,... , p, be the distinct prime factors of n. Since 


y +x # 0, by Chapter IL, (3B)(3), y” +x” = (y + z) - Q,(y, x) 
is a multiple of (y + x) []j_, Qr. (y, Fx). By the same result, Part 
(2), the integers Qp, (y, F£) (for i = 1,... ,r) are pairwise relatively 
prime. 

We observe that 2 + y” > x + y and (x? — y™)/(x — y) = 
LTI Py 4... yT > 1. So each Q,, (y, Fx) has at least one 
prime factor. Hence, a,b have at least r distinct prime factors. 

(2) Let m be the product of the r distinct prime factors of 
n, so m | n. First we show that n = m. We have a = (y™ + 
x™)(y” + z")/(y" + x). By (1) and the hypothesis, y" + z^ has 
exactly r distinct prime factors, which are the same as those of a. If 
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n > m then a has no primitive factors. By Chapter II, (3J), we have 
a = 2? + 1?, so n = 3 = m, which is a contradiction. Thus n = m, 
i.e., n is the product of r distinct primes. 

Let r=1,n=p,soa=(y+2)Q,(y, —). By hypothesis, a is 
the power of some prime q. Since q divides y + x and Q, (y, —x) > 1 
then by Chapter II, (3B)(4), q = p; by Part (6), Q,(y, —x) = p. By 
Chapter II, (3D), then p = 3, y 22, x = 1. 

Let r > 1, n = ph, p being a prime, p Yh; so h is the product of 
r — 1 distinct primes. From a = (y^ + zx")Q,(y^, —x^), by (1) and 
the hypothesis, y^ + x” has r — 1 or r distinct prime factors. By 
induction on r, if y^ + x" has exactly r — 1 distinct prime factors 
then h = 3, y = 2, x = 1. So n = 3p (with p > 3), r = 2, and 
2P + 1 = (2? + 1)Q3(2?, —1). 

If there exists a prime q Z 3, such that q | 2P +1 then by Chapter 
IL, (3B)(6), v,(Q3(27, —1)) = v,(3) = 0, so q  Q3(2?,—1). Since 
r = 2 and 3 | 2? +1 then Q3(2?,—1) is a power of 3; again by 
Chapter II, (3B)(6), Q3(2?, —1) = 3, so by Chapter II, (3D), 2? = 2, 
which is impossible. 

If 2? + 1 = 3* then 3° = 1 (mod 8), because p > 3. Thus s = 2s'. 
So 2? = 35 EEN — 1) hence 3%” + 9*3 — 1 = 2° 
with p—c > c > 0. Taking the difference 2 = 2?~°—2° = 2°(2?-*°—1), 
so c = 1, p — 2c = 0, i.e. p = 2, which is impossible. 

If y^ + z^ has exactly r distinct prime factors, then each prime 
factor q of Q,(y^, —x^) divides y^ + x^. By Chapter II, (3B)(4), 
then q = p; Q, (y^, —c^) > 1, therefore it is a power of p; by Part (6) 
of the same proposition, Q,(y",—x") = p and by Chapter II, (3D), 
p = 3, y? = 2, x” = 1, so h = 1, n = 3, concluding the proof. 

(3) If b = y" —z" has exactly r distinct prime factors, by Chapter 
II, (3B)(2) and (3), y" —z" is a multiple of (y— x) [l- Qr, (y, zx) and 
the integers Q,, (y, x) > 1 are pairwise relatively prime, hence by the 
hypothesis, they are prime powers. If y — x » 1 and q is a prime 
dividing y — x then there exists i such that q | Qp, (y, x). By Part (4) 
of the same proposition, q = p; and by Part (6), Qp, (y, x) = pi. By 
Chapter II, (3D), p; = 3, y = 2, x = —1, which is a contradiction. 
This proves that y = x + 1. 


As a corollary, we have: 


(3B) ` Let n 2 3 be a positive integer having r distinct odd prime 
factors. If 0 « x « y « z are relatively prime integers such that 
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x" + y" = z^". then z,y have at least r+1 distinct prime factors and 
x has at least r distinct prime factors. Moreover, if x has only r 
such factors then n is odd and z — y = 1. 


PROOF. By Chapter I, n is not a multiple of 3, nor a power of 2, so 
r 2 1. Let n= 2"m, with u > 0 and m odd, having r distinct prime 
factors. Let 2; = 27,91 = y^, 21 = 27. Since z^ = 2? = gY +y” 
and y” = yf" = 27 — y} and since m Æ 3 and z, — x, > 1, it follows 
from (3A) that z", y", hence also z,y, have at least r + 1 distinct 
prime factors. 

Similarly zx" = x" = 27% — y}, so x”, hence also x, has at least 
r distinct prime factors. If x has only r such factors then by (3A), 
z1 = y1 +1, ie., zi — y" +1. Sou 20,2 = y + 1 and n odd. 


More explicitly: 


(3C) Letn » 2, and let 0 < x < y « z be relatively prime integers 
such that x" + y^ = z^. Then: 


(1) y, z are not prime powers; and 
(2) if x is a prime power then z = y +1 and n is an odd prime. 


PROOF. (1) If z or y is a prime power, so is z" = x" + y", or 
y" = z" — x". By (3B), n is a power of 2, n > 4, and this contradicts 
Fermat’s theorem, which is true for such exponents. 

(2) If a is a power of a prime q then by (3B), z = y + 1 and 
n = pê, e > 1, p an odd prime. We show that e = 1. 

Ife>1 then since 2?—y? > 1 and Qpe- 1(2P, yP) = (2? — y”) /(2?— 
y?) > 1, it follows from x? = 2?° — y” = (z? — yP) - Qye-1(2?, yP) 
that bots factors in the right-hand side are powers of q, greater 
than 1, hence multiples of g. By Chapter II, (3B)(4), q divides 
ged(z? — y”, Qpe-1(2”, y?)) = ged(p* !, 2? — y"), hence q = p. 

On the other hand, since z = y+ 1 then z? — y? = py?! + 
(Dy? +- (vy + 1, so p = q does not divide z? — y”, which is 
a contradiction. 


Moller proved the following complement to (3A): 


(3D) Ifm is odd, m > 3, with r distinct prime factors, if0 < x < y 
are relatively prime integers, then b = y?" — x?" has at least 2r +1 
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PROOF. b = (y" — z")(y" + x") and since ged(x, y) = 1 then d = 
gcd(y" — z^, y" + x) = 1 or 2. By (3A), y" — x" and y" + x" 
have at least r distinct prime factors. Since m > 3, by (3A), y" +e” 
has at least r + 1 distinct prime factors. 

If y" — z^ has exactly r distinct prime factors, by (3A), y = x+1, 
thus y” — x" is odd. Hence d = 1 and b has at least 2r + 1 distinct 
prime factors. If y” — x” has at least r + 1 distinct prime factors, 
since d = 1 or 2, then b has at least 2(r + 1) — 1 = 2r + 1 distinct 
prime factors. [] 


In particular, if m is odd, m > 3 with r distinct prime factors, if 
0 < x « y < z arerelatively prime integers such that z?" +y?” = 22” 
then x, y have at least 2r +1 distinct prime factors. This was shown 
by Moller in 1955. Combining with (3B) it follows, with the above 
exponent 2m, that x,y,z cannot be prime powers. 

Inkeri showed in 1946 that if0 < x < y < z, if pis a prime number, 
p y zyz and x? + y? = 2”, then z — y > 1. Hence by (3C), x is not 
a prime power. This provided a direct proof of Abel’s conjecture in 
the case when p / xyz. No direct proof is known, when p | xyz, that 
x is not a prime power, so no direct proof that z > y+ 1 has been 
devised. 

In 1886, Catalan examined the implications of this eventuality. 


(3E) Let p be an odd prime number, and let 0 < x « y be integers 
such that x? + y? = (y 4- 1)?. Then: 

(1) py(y + 1) divides x? — 1. 

(2) pz, pz -1. 

(3) If q is a prime dividing y + 1 — x then q divides x — 1. 

(4) ged(z +y, y - 1 — x) — 1. 

(5) ged(2x — 1, 2y + 1) = 1. 

(6) x is the only integer such that 


(py)? < x < (p(y- 197)". 


PROOF. (1) The polynomial (Y +1)? —Y? — 1 is a multiple of p, Y, 
and Y +1, hence z? — 1 = (y--1)? — y? — 1 = py(y-- 1)h, h an integer. 
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(2) Since p | x? — 1 then p Yx. From z? = x (mod p) it follows 
that p | x — 1. 

(3) yP = (y+ 1)? — x = (y +1 — x)k, where k is an integer; 
hence if q| y--1— xz then q | y, sog | x — 1. 

(4) Ifq is a prime dividing x + y and y +1 — z then q| z—1 
so q | y, hence q | x; but from x? + y = (y + 1)? it follows that 
gcd(z, y) = 1, a contradiction. 

(5) Ifq is a prime dividing 2x — 1, 2y + 1, then it divides their 
sum 2(r + y) and their difference 2(y — x + 1); but q is odd, so 
gly—-æ+1,q|x+1, which is a contradiction. 

(6) We have 


(y +1)? -= y?” 
(y+1)-y 
S A y pe Dy by" ^. 


e e cs 


hence 
pes z” < p(y + Ps 
and this yields the inequalities of the statement. 
Finally, we note that if x1, £2 are integers such that 


d E 
(pur tr < z4 < z3 < (p(y - 1)*71) /? 
then 
1/(p—1) 1/(p—1) 
x x 
y«am (=) and To (=) <y+l, 
hence 
1/(p—1) 1/(p—1) 1/(p—1) 
x £ x 
(£2 — 21) (=) < X (=) — q41 (=) < 1. 
p p p 


But 1 < z — zı and p = (p^)? < (pye-1)/? < a, since by (2), 
p<æ—1<x< y. Therefore 1 < (2/p)/®-) and so 1 < (x2 — 
24) (24 /p)!/ ?-9, which is a contradiction. 


We now shall use Barlow's relations of Chapter III, 81, with an 
obvious change of notation, since we consider the relation z?-FyP = zf 
(with 0 < x < y < z) instead of z? + y? + z? = 0. In particular, we 
use the integers r, s,t defined in those formulas. 

In 1964, Dittmann proved the following facts: 
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(3F) ` Let p be an odd prime, and let 0 < x < y be integers such 
that x? + yP = (y + 1)? (hence x,y are relatively prime). Then: 


(1) If p| y - then v,(—s + 1) 2 vy +1) — 1. 
(2) If p| y then v,(t — 1) = v,(y) — 1. 


PROOF. (1) Let z = y+1 and z —z = —s?, so s Z —1. By Chapter 
III, (1C), n = v,(y +1) > 2and p" divides z and z + y. So p” 
divides 2z — (x +y) = z — x +1 = —s? +1. Since s? = s (mod p) 
then p | —s +1. Let v,(—s4- 1) 2 1 7 1. By Chapter II, (3B)(6), we 
have n € u(—s? +1) 21-1, 0| 2 n— 1. 

If v,(—s+1) > n then by the fact just quoted above, v,(—s?+1) > 
n + 1, hence p"*! divides —s? + 1 = 2z — (x + y). By (1.2), pP”! 
divides x + y, hence p"*! divides 2z, so p"*! | z = y + 1, contrary to 
the hypothesis. This shows that v,(—s + 1) 2 n — 1. 

(2) Let v,(y) = n, so by Chapter II, (3B), n > 2 and by (1.2), p" 
divides y and y+1—x = 2y—(x4-y—1). So p" divides x4-y-1 = t?—1. 
Since t? = t (mod p) then p divides t — 1. Let v,(t—1) — > 1; then 
by Chapter II, (3B)(6), n € v,(t? — 1) =141, hence l > n — 1. 

If v,(t — 1) > n then v(t? — 1) > n +1 so p"*! divides t? — 1 = 
2y — (y - 1 — x); but p?"^! divides z — x = y + 1 — x, therefore p"*! 
divides 2y, so p"*! | y, which is a contradiction. 


'The situation covered in the preceding result cannot yet be ruled 
out by a direct proof. With other methods, it may be shown that 
under the hypothesis of (3E), necessarily p divides y or y+ 1. 

In his thesis, Dittmann has also studied the possibility of a solution 
of Fermat’s equation, with y = x + 1. He showed: 


(3G)  lfp is an odd prime number, if there exist positive integers 
0 « z « 2 such that x? + (x + 1)? = sf, then: 

(1) p | x(x +1). 

(2) If p| x1 then —r = [-p^-V?s| andr & 1 (mod p). 

(3) If p | x then —s = [-p"-V/?r| + 1 and —s = 1 (mod p). 


PROOF. (1) If p z(z +1), by Chapter III, (1C), z — x = —s? and 
z — (x +1) = —r? so 1 = r?” — s”, which is impossible, since r,s are 
not zero. 

(2) Ifp|a+1 then p/ zz, so by Chapter III, (1), z — x = 
—pP"-isP,z — (x +1) = —r^; hence 1 = r? — pr lp, sor = 1 


202 


VI. Arithmetic Restrictions 


(mod p) and —r? < —p?^-1sP, so —r < —p™—1/Pg, 
If —r < —p^-/?s — 1 then r > p"1/Ps +1, hence r? > pls +1, 
which is a contradiction. This shows that —r = [-p"^- /Ps]. 


(3) 


If p | x then p (a+ 1)z, so by Chapter III, (1€), z — x = 


af, z — (x + 1) = —pP"-!r*; hence 1 = p?"^"!r? — s", so —s = 1 
(mod p) and —s? > —p?"-1rP, hence —s > —p^- Vr, 
If —s > —p^-VPr-EF1 then s+1 < p"-!/?r, hence s? +1 < p?"-1r?, 


which is a contradiction. This shows that —s = [—p"~!/?r] + 1. 
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VIA. The First Case for Even Exponents 


In this section, we give Terjanian’s proof (1977; see also 1978) that 
the first case of Fermat’s theorem holds for even exponents. It suffices 
to consider exponents 2p, where p is an odd prime. The proof will 
require only elementary considerations, so it is surprising that it was 
not found beforehand. 

Several authors had considered Fermat’s equation with even ex- 
ponents, however their direct proofs are now all superseded by Ter- 
janian’s; yet we shall quote, and in a few cases prove, some of these 
statements. 

To begin, in a letter to Gauss (1804), Sophie Germain stated with- 
out proof that if p is a prime, p = 7 (mod 8), then X^! + Y»! = 
Z?-! has no solution in nonzero integers. 

In the first of a long series of papers on Fermat's theorem, Kum- 
mer proved (1837) the following result, which was rediscovered by 
Niedermeier in 1943 and again by Griselle (1953) and Oeconomou 
(1956). We follow Griselle's proof: 


(4A) Letn > 2 be an integer. If there exist nonzero integers x, y, z 
such that x?" + y?" = 2?” and gcd(n, xyz) = 1, then n =1 (mod 8). 


PROOF. By Fermat’s theorem for the exponent 4, we may take n 
odd, n > 3. We may also assume that x,y,z are pairwise relatively 
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prime positive integers, and that x is even, while y, z are odd, by the 
remark at the beginning of Chapter I, §1. Then 


en = yen _ y" 


= (22 — y) (z2070 py- A p r2 2 4 yD) 
= (2 — y’) Qu.) 
with the notation of §1. By Chapter II, (3B)(4), 
ged(2? — y^, Oe, y°)) = ged(2? — y^, n). 


If p is a prime dividing n and 2? — y? then a fortiori p divides x, 
contrary to the hypothesis. Thus z?—y? and Q,(z°?, y?) are relatively 
prime. 

So z? — y? and Q,(z?, y?) are 2nth powers. Moreover, 2? — y? is 
even so Q,(z?, y?) is odd. Thus there exists an odd integer k such 
that 


z2- 0) a 22-2, + gina EE sëch = Lin 


Each term of the above equality is an odd square, so it is of the form 
(2a+1)? = 4a(a+1)+1=1 (mod 8). Thus, we have the congruence 
n = 1 (mod 8). 


From this result, it follows: 


(4B) The set of primes p such that the first case of Fermat's 
theorem holds for the exponent 2p is an infinite set. 


PROOF. According to Dirichlet’s theorem for primes in arithmetic 
progressions, there exist infinitely many primes p satisfying each of 
the congruences p = 3 (mod 8), p = 5 (mod 8), p = 7 (mod 8). For 
each such prime p the first case of Fermat's theorem holds for the 
exponent 2p, by virtue of (4A). 


We note the following strengthening of (4A); statement (1) is due 
to Niedermeier (1944), while (2) was given by Grey (1954): 


(4C) ` Let p be an odd prime and assume that there exist nonzero 
relatively prime integers x,y,z such that x?? + y”? = z?P : 


(1) If 3p does not divide x,y,z then p=1 (mod 3). 


VI.4. The First Case for Even Exponents 205 


(2) If 2p does not divide x,y,z then p = 24a + 1 (for some inte- 
ger a) and 12a + 1 has no factor congruent to 3 modulo 4. 


Another improvement over (4A) is the following ((1) was proved 
by Niedermeier in 1944, while (2) was given by Long in 1960): 


(4D) Let m 2 3 be an integer and assume that there exist nonzero 
integers x,y,z such that zm + y?" = z?m . 
(1) z m — p is a prime and if 5p does not divide x,y,z then 
= +1 (mod 5). 
(2) if gcd(m, xyz) = 1 then m = 1 or 49 (mod 120). 


As a corollary: 


(4E) If n = 2m has last digit 4 or 6 (when written in deci- 
mal notation), then there exist no nonzero integers x,y,z such that 
gcd(m, xyz) = 1 and x" + y^ = z^. 


PROOF. We have m = +2 (mod 5), hence m Æ 1 or 49 (mod 120) 
and the result follows from (4D). 


Long (1961) extended his method and proved: 


(AF) Ifp is a prime, p = —1 (mod 10) and if there exist relatively 
prime integers x,y,z, prime to p and such that x7? + y?? = z”, then 
p is a square modulo 11, and consequently p = 49, 169, 289, 529, or 
889 (mod 1320). 


Oeconomou proved in 1956 the following results, involving the 
Legendre symbol: 


(4G) Letn >1 be an odd integer and assume that there exists an 
odd prime q such that: 
(a) ged(g — 1,2) = 1. 
(b) (n/q) = —1. 
(c) If 0 € m < (q— 1)/2, with n =m (mod (q — 1)/2), if a,b,c 
are integers such that a?" +b?” = c?" (mod q), and 


2 2 DS r2 2 p2 
(=)= (¢ =) =1, (* =) =1, 
q q q 
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then abc = 0 (mod q). 


Then there exist no relatively prime nonzero integers x,y,z such that 
gcd(n, xyz) = 1 and x?" + y” = z”. 


There are numerous possible choices of q,m satisfying the condi- 
tions. For example: In any of the following cases, the first case of 
Fermat’s theorem holds for the exponent 2n: 

(a) n = —1 (mod 3). 
+2 (mod 5). 
Greg —] and n=4 (mod 5). 
(n/19) = —1 and n = 4 or 7 (mod 9). 
(e) (n/23) = —1 and n = 2, or 3, or 4, or 10 (mod 11). 
(f) (n/29) = —1 and n = 3, or 5, or 9, or 11 (mod 14) etc. .... 


In this way, Oeconomou proved that the first case of Fermat's 
theorem holds for all even exponents less than 200 000 (with the 
possible exceptions of 108 722 and 188 018). 

Gandhi proved in 1966: 


(4H) Let p be a prime, p > 5. If x,y,z are pairwise relatively 
prime integers such that x?! + y?-' = z?-! them z is odd, and p 
divides the one among the integers x,y which is even, hence p does 
not divide z. 


PROOF. Since p— 1 is even, by the remark at the beginning of Chap- 
ter I, 81, z is odd and x,y have different parity, say x is odd, y is 
even. 

If p / zy then z? ^! = y?! = 1 (mod p) while z^! = 0 (mod p) 
or = 1 (mod p). Thus 2 = z^! + y^! = 0 or 1 (mod p) which is 
impossible. Let us assume that p | x, p / y — this will lead to a 
contradiction. Let m — (p — 1)/2. We have 


x? 1 _ zP 1 yP b= (2™—y™)\(2z™ +y”). 


Then the two factors in the right-hand side are relatively prime, since 
they are both odd. Therefore each factor is a (p — 1)th power: 


zm EP. y" = a?” 1, 

z™ 4y” er, 
hence x = ab and also 22" = Of TT But p | x and gcd(a, b) = 1 
so p divides one and only one of the numbers a,b. Hence, 22” = 
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1 (mod p) and squaring this congruence, 4 = 1 (mod p), which is 
impossible. 
This proves that p divides y. Hence p does not divide z. 


In 1969, Raina proved: 


(4I) Let p be a prime, p > 5. If x,y,z are positive pairwise 
relatively prime integers such that x?! + yP-! = zP-! then z is a 
quadratic residue modulo p. 


PROOF. Only one of x, y is even, the other is odd. Suppose y is even. 
By (4H), z is odd and p divides y. 

Let m = (p— 1)/2. From (2™)? + (y")? = (z")?. it follows by 
Chapter I, (1A), that there exist positive integers a,b, of different 
parity, such that gcd(a, b) = 1 and 


gm = a? = b?, 
y" = 2ab, 
20 = a? +b. 


Suppose first that b is odd, so a is even. Since gcd(2a,b) = 1 then 
there exist integers A, k such that 


2a = h", 
oe 
Therefore 42™ = h?-1+4kP 1, If p| kthen p /hso4z™ =1 (mod p). 
Hence, squaring this congruence, 16 = 1 (mod p) so p = 5, which is 
not possible because the equation X4 + Y^ = Z^ has no solution in 
positive integers (Chapter I, (2C)). 
So p Yk and since p | y then p | k. Therefore h?~' = 4a? = 0 
(mod 16p) and z(?-9/? = k?-! = 1 (mod p). 
Suppose now that b is even, while a is odd. Proceeding in the 
same way, we show that z(?-9/? = 1 (mod p). Thus z is a quadratic 
residue modulo p. 


The next result in Raina’s paper is vacuous since its hypothesis is 
never satisfied (namely, z cannot be a prime, by (3C)). 

In 1955, Becker published a paper in which he asserted that Fer- 
mat's theorem is true for all even exponents 2m > 2. However, his 
proof is definitely wrong. 
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Now we shall give the proof of Terjanian’s theorem, which con- 
tains all the above results as corollaries. Once more, we shall consider 
the quotient Q,(z,y) = (z" — y")/(z — y) where n is an odd natu- 
ral number and z,y are nonzero distinct integers (not necessarily 
positive). 

For the convenience of the reader, we recall that if m, n are nonzero 
odd integers, n > 0, gcd(m, n) = 1, then the Jacobi symbol (m/n) 
is defined. If m > 0 then (n/m) is also defined, and the following 
reciprocity law is satisfied: 


(4.1) (2) = (—1)m-D/2x(-1/2 (2) 


Moreover, 


(4.2) (=) = (—1)-0/2, 


(4J) Let y,z be distinct nonzero odd integers such that y = 
(mod 4) and gcd(y, z) = 1. Let m,n be odd integers, m > 1,n 
1, ged(m,n) = 1. Then: 
(1) Qn(z,y) = m (mod 4), and in particular Q,(z, y) is odd. 
(2) The Jacobi symbols (Qm(z,y)/Qn(z,y)) and (m/n) are well 
defined and equal. 


IV x 


PROOF. (1) Let z 2 y+ 4t. Then 


(y + 4t)* — y" m - m ab 
ra a e m m-24t.L... 
Quiz) a 18 + 9 JY + 


= my"-!z m (mod 4), 


because m — 1 is even, y is odd, so y^^! = 1 (mod 4). 

(2) First we note that since gcd(m,n) = 1 then the Jacobi 
symbol (m/n) is well defined. Similarly, from ged(y,z) = 1 and 
Chapter II, (3B)(2) it follows that gcd(Q,,(z, y), Qn(z,y)) = 1; since 
Q,(z,y) > 0 then the Jacobi symbol (Qm(z,y)/Qn(z, y)) is also well 
defined. 

'The equality of the Jacobi symbols is proved by induction on k — 
min{m,n}. It is trivial when k = 1. Let k > 1, so m Z n, because 
gcd(m, n) — 1. 
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If m > n then there exist an odd integer r, 0 < r < n = k, and an 
integer q such that m = qn +r, or m = qn — r. If m = qn +r then 
m — r is even, so by Chapter II, (3B)(1), 


(ae) " EEE) " (Se) = (2) - (=) 
Qn(z,y) GET Qn(z,y) n nj 
If m — qn — r then m — n and n — r are even, so by Chapter II, 
(3B)(1), induction and the properties of the Jacobi symbol, 


(8) - Cea ges) 


= (en) = (en) Gen) 
E Esc DI 
Since Q,(z, y) = n (mod 4) by Part (1), then (Q,{(z,y) — 1)/2 = 
(n — 1)/2 (mod 2), hence by (5.2), (—1/Qn(z, y)) = (—1/n). Thus 


k CHE 


Now, if m « n, by the reciprocity law (4.1) for the Jacobi symbol 
and the above proof, 


(GE) era (E) 


= (—1)-D/2x-1/2 (=) _ (=). 
m TU 


Terjanian’s result now follows very easily: 


(4K) ` Let p be an odd prime. If x,y,z are nonzero integers such 
that x? + y? = z?? then 2p divides x or y. 


PROOF. There is no loss of generality to assume that x, y, z are pair- 
wise relatively prime. Also, x,y cannot be both odd, since the expo- 
nent 2p is even (see remark at the beginning of Chapter I, §1). Let 
x be even, so y, z are odd. Then 


x?P = zP =? y? = (2 EI y^) 
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By Chapter II, (3B)(4), 


2p 2p 
242 —V 


ged (y )=p or 1. 


? ES Le y? 
If the greatest common divisor is p then p divides z?? so 2p divides 
x. 

We show now that it is not possible that 2? — y? and (z?? — 
y?)/(2? — y?) be relatively prime. If they are, they must be squares. 
But 


LP a zf — yP " zP + uf 
z? — y? z—y 2+Y 


= Q(z, y) x Qyz, —y) 


and these two factors are relatively prime because gcd(y, z) — 1 and 
y and z are odd. So Q,(z, y), Q,(z, —y) are also squares. Since p is 
not a square, there exists an odd prime q such that p is not a square 
modulo q. 

Assume first that z = y (mod 4). By (4.1), 


BOCH 
q Q«G.y)/ ' 
which is an absurdity, because Q,(z, y) is a square. If z Z y (mod 4) 
then z = —y (mod 4), hence again 


- (0-3) 


which is again an absurdity. This concludes the proof. 


In 1981, Rotkiewicz showed the following strengthening of Terja- 
nian's result: 


(AL) Let p be an odd prime. If x,y,z are positive integers such 
that x? + y” = z2”, then 8p? divides x or y. 


PROOF. We may assume that x,y,z are pairwise relatively prime 
and that x is even, while y, z are odd. Then by Terjanian’s theorem, 
2p divides x. 

Now we show that 8 divides x. We have gcd(z? — x?, z? +. x?) =1 
because gcd(x, z) = 1, x is even and z is odd. Since y?” = z”? — 5?» = 
(z? — xP)(z? + x?) then z? — x? = [(z? — x?)/(z—2)|(z — x) = a”, 
where a is an odd positive integer. Since p | x then p Yz so p /z — x. 
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By Chapter II, (3B)(5), ged ((z? — x?)/(z — x), z ^ x) = 1, hence 
(2? — z")/(z — x) = b?, with b an odd positive integer (because z 
is odd). Hence b? = (2b, + 1)? = 4b,(b, + 1) = 1 (mod 8) and 
zP-l-pzP ?y.-pzP 33? = 1 (mod 8), because x is even. But z being 
odd, we have again z?~' = 1 (mod 8), so 2?-*x(z+x) =0 (mod 8). 
Hence y = 0 (mod 8), because z, z + x are odd. 

The proof that p? divides x follows from the result of Vandiver, 
(1B). We have also z? + x? = CT, where c is an odd positive in- 
teger. By Vandiver’s theorem we have x? = x (mod p?). Since 
p | x and p> 3 then p? divides x, showing that 8p? divides x. 


In 1950, Gut adapted ideas of Kummer and Mirimanoff and used 
methods of class field theory to derive a criterion, involving Euler 
numbers, for the first case of Fermat’s theorem with exponent 2p. 
Of course, this result is now obsolete. 

In conclusion we mention explicitly the following easy fact, which 
will be used later: 


(4M) If x,y,z are nonzero relatively prime integer, n > 3 and 
gen + y?" = 2%, then 2| zy and 3 | zy. 


PROOF. If x,y are odd then x?" + y?" = 2 (mod 4); now z must be 
even and z?" = 0 (mod 4), which is absurd. If 3 / zy then z?" = 
y?" = 1 (mod 3) but z?" = 0 or 1 (mod 3), so xz?" +y” = z?" is not 
true. 
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VI 
Interludes 7 and 8 


This chapter deals with polynomials which are intimately related 
with Fermat’s equation. 


VIL1. Some Relevant Polynomial Identities 


We give here some algebraic identities which are applicable in the 
study of Fermat’s equation. Many of the early attempts to prove 
Fermat’s theorem were based on some polynomial identities. 

To begin, we indicate the following identity, which was used by 
Lamé in 1840; see also Lebesgue (1847), Mention (1847), and Catalan 
(1885); Gauss (1863) gave the special cases when n = 3,5, 7; see also 
Rebout (1877), and Brocard (1878): 


(LA) If X, Y, Z are indeterminates and n is odd then 


(X4Y4Z)" —(X4Y—Z)" - (X-Y4Z) - (-X4Y4 En 
(n — 1)! 


x? y? gk 
(iF 1) 25 F 1)! 2k + 1)! 


= AnXYZ M; 


i+jtk=(n—3)/2 
Ljkz0 
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PROOF. By writing explicitly the nth powers of the left-hand side, 
we have: 
(Xc-Y-Z)-(X-c-Y-Zy)-(X-Y-«Z2)"-(-X«Y-«Z)* 
! 
= Y XYZ- (-1)° - c2) - oc. 
a! b! c! 


a+b+c=n 
a,b,c>0 


Since n is odd and a 4- b 4- c = n then one or three of the integers 
a, b, c are odd. If only one is odd then 1 — (-1)* — (C1)^ — (-1)* = 0. 
Thus, we have to consider only the summands with a = 2i + 1, b= 
2j7+1,c=2k+1, soi+j+k = (n — 3)/2 and 1 - (-1)* — (-1)^ 
(—1)* = 4. We deduce at once the identity of the statement. 


As a corollary, we have (see Werebrusow, 1908): 


(1B) If X,Y, Z are indeterminates and n is odd then 


4 
(RAVE Z =X Steg an (X+¥)(¥+Z)(Z+X) 


(n — 1! CX +Y)*(¥ + Z)9(Z +X) 
(i+ 1) (2j + 1)! (2k +1)! 


x 


itjtk=(n—3) /2 
i,j,k >20 


PROOF. We write the identity of (1A) with U,V,W in place of 
X,Y, Z, respectively, where U = (X + Y)/2, V = (Y + Z)/2,W = 
(Z + X)/2. Then 
U+V+W=X+Y+Z, 
ÜXYV-werY 
U-V4W =X, 
sy =z, 
hence (1A) becomes 
(X+Y+ZP-X"-Y"- mm 
4 
= x CE Y)  Z)Z + X) 


(n — 1)! (X -Y)*(Y +2) (Z 4- X?* 
(2i 4- 1)! (25 +1)! (2k +1)! | 


x 


i+jtk=(n—3)/2 
DEN 
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A special case is the following: 


(1C) If X,Y are indeterminates and n is odd then 


(Er ere 


4 
= (X+Y)XY 


(nt)! A 2i 2j y 2k 
s Or ED EE 


i+j+k=(n-3)/2 
DEEN 


PROOF. It suffices to replace Z by 0 in the identity (1B). O 


Already in 1837, Kummer used the following identity for (X 4- 
Y)" —(X"-- Y"), see also Mention (1847), Vachette (1861), Barisien 
(1906), Boutin and Gonzalez Quijano (1907), Bini (1907), Rose 
(1907), and Bachmann (1910): 


(1D) Jf X,Y are indeterminates and n > 1 then 


(X+YV)"—(X"+Y") = Dh C i EN A Y'(X VI" 


i=1 


(by convention, the terms in the above sum are zero when 2i > n). 


PROOF. We show by induction that the identity 


i nfn—-i- 
EEN m "Jensen x 
i=1 


is true. For n = 1,2 it is trivial. Then 


Xrti + yn = (xe 4 Y"(X + Y) = AY AS ques 
= (X yet 


in —i—1 e 

hal X!y? X Y n4-1—2i 
(Mc ; (EN M" | (X+Y) 
-xYü ye 


Se 17 - C | ?— P nyng 1—2i 
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= (X ANITA (1Y aX Y (X + YY) 


i=1 


where c; =n+1 and if i > 2, then 
|. n—i-—l1 Nik: n—i-—l |. n «1 n-—ài 
SE vases gp -2 | seo] d 


In 1885 (and again in 1886) Catalan indicated another form of the 
identity (1B): 


(1E) If X,Y,Z are indeterminates and n is odd, then 


(XY EZ) SIX Synt 


= Dn? + H P?-* H Ppr-> 
(X+Y)(Y +2)(Z+X) : i 


(2) 
+ $^ + Has + 2H, 3)/2 


where P= X +Y + Z = Hi; more generally, if à > 1, Hj is the sum 
of all monomials of degree i, coefficients 1, in the indeterminates 
X,Y,Z, and Hos is the sum of all monomials of degree (n — 
3)/2, coefficients 1, in X?,Y?, Z? (so it has degree n — 3), that is, 
Hs =| H(n-3)2(X°, Y?, Z^. 


PROOF. We have 


Dn un (X^ + yr + 257) Dn de Zr Xn + yr 


Qı = KAY aa E 
= (Po + ZP? + Z?pn-3..... p Zn) 
—(X"1-YX"?-Ly?x"» 3... yn, 


But, by Euclidean division, since Y + Z = P — X: 


Pn Zpe 4 Sift, + Zi 
Y +7 
= pr? + HN. ZP? + H(X, Zype* Se + Hy_o(X, Z) 
AA oie ce a a 
i Y+Z i 
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where H;(X, Z) is the sum of all monomials of degree i, coefficient 
1, in the indeterminates X, Z. Therefore 


_ & 
Q2 = YZ 
(P" + H (X, Z) P" + (AZ) PE + + + Hy 2X, 2) 


HS + Z)x"? = (Y? Pa Zug + (X3 + ZA 


+ 


X 


pets 
= (PP? + H(X, Z)P"^5 + H(X, Z)P"-* E BS SC Z)) 
-L(X"?-(Y-Z)Xx"?^-(Y'-ZY-2)Xx""*- 
ceu E si un 


By Euclidean division, 


P= + H(X, Z)P + H(X, mpeg RIT, SX Z) 
ZFX 
= P 34 A, Pp" *+ HP -o Hag 
"n "ENDE Z)Y® + DA Et EE EE 
Z+X 


where H. is as indicated in the statement. Then 
Q» 


Q3 = ZiX 
PP HiP + HP abated Hang 


1 
2 LE ? 4 H(X, ZY” + H((X,Z)Y"* 
++ Hy, o(X,Z) +X"? — (Y — Z) X" 


LIN ER -25x"*- 


H 


— (Y? — ZY" 4 77y"4_...-Z*)], 
But 
Hi(X,2) _, 
Z+xX ; 
It) 2 = X? + Z? = HP(x,zZ), 
BOE) _ ya, yop 4 74 = HX, 2), 


Z+xX 
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etc. Hence 


Qs = Pes uphes APRES ee H Hn aye 


+H? (X, Z)Y" + HY (X, ZY"7 + + HE Al, Z) 
1 
Y”? + H(X, Y EX Kee 
Vo yl zs »( H ) F al , ) ck 


E Ke (Y = Ee 
+(¥°-Y¥Z4+277)x**- 
éi ye E vm? + Z2yn-4 REPE Zi 


The expression in the brackets is equal to 


jac PYÝZX AY ZAY ec 

T(YZ)-YSZ)X" 8$ Tewes + YS)X T + 
HY Z YSZ +... H YZ Y?) 
+X- (Y —Z)X™ 34 (Y° -YZ + Z»)x"*- 
—(Y"? —Zy" IE Y Ea ERC 

exem RIX Da ames Ex 
HYHY? Z + ZAX H (Y ZHY? Z + ZX 
TEEN (Y*3Z HYZ? eet Z?) 

= X"? 4 ZX"? + HP (Y, EH + CN SE 
-Hf(Y, Sien + HP(Y, ZZX"7 +. + HG 


(Y, Z)Z. 


ae 3)/2 
Hence 
Q; = P" + HiP + BoP? +- + Hn- + ere. 
HY (X, Z)Y"5 + HP (X, Z)Y"7 e HO (X, Z) 
LX" 4 HOY, Z) X" HI Aalt, Z) 


= r3. HQPU + HP" ++ Hn- 42H np, 


since each of the above last two lines is equal to H? c 3/2: 
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VIL2. The Cauchy Polynomials 


In his proof of Fermat’s theorem for the exponent 7, Lamé (1839, 
1840) made use of a polynomial identity of degree 7. In the analy- 
sis of Lamé’s paper, Cauchy and Liouville indicated a more general 
polynomial identity (1839), of which the following ones are special 
cases (Cauchy, 1841): 


(X+Y)?—X°-Y° = 5XY(X - Y)(X? + XY + Y?), 
(X+Y) -X'-Y! = 7TXY(X - KIC + XY + Y?y. 


In this way the study of the polynomial (X + Y)" — X" — Y” was 
initiated. 

If n > 3, n odd, then this polynomial is a multiple of X, Y, X +Y; 
moreover, if n = p is an odd prime, then it is also a multiple of p. 


(2A) Letn = +1 (mod 6). The exact power of X? + XY + Y? 
dividing (X +Y)” — (X^ +Y”) has exponent 


|. f 1 when mz —1 (mod 6), 
"712 when n=1 (mod 6). 


PROOF. We shall show that (X + 1)" — (X^ +1) = (X? + X + 
1)* H,, (X) where eis as indicated, H,(X) € Z[X] and X°?+X +1 does 
not divide H,(X). By homogenization, we deduce the statement 
(2A). 
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Let G,(X) = (X +1)" — (X" +1) and let w = (—1 + /—3)/2 be 
a primitive cubic root of 1, hence w? +w+1= 0. Then G,(w) = 
(w 4- 1)" — (w^ 4- 1) = —(w?" + uw" 4-1) =- (w — 1)/(w" — 1) = 0. 

So G,,(X) is a multiple of the minimal polynomial X? + X +1 of 
w. Explicitly, by division, G,(X) = F(X)(X?+ X 4-1) + (aX +b) 
where F(X) € Z[X] and a,b € Z. So 0 = Galw) = F(w)(w? +w + 
1) + aw + b = aw + b, hence a = 0 (otherwise w = —b/a € Q, which 
is not true) and also b = 0. 

Next, (X? + X + 1)? divides G,,(X) if and only if w is a double 
root of G,(X), that is, w is a root of the derivative G,'(X) = n((X + 
1)! — X"-1), i.e, n[(w -- 1)^7! —w""] = 0. Since w + 1 = —w?, 
this is equivalent to (w”~! — 1)w"~! = 0, that is, w”! = 1. Finally, 
this holds if and only if 3 divides n — 1, son = 1 (mod 6). 

We show that (X? + X + 1)? does not divide G,(.X). If it did, 
then n = 1 (mod 6) and moreover X? + X + 1 divides G,"(X) = 
n(n—1)((X--1)"?— X"-?). So G,"(w) = 0, hence (w--1)"? = w 2? 
and —w("-?2) = u^-2, Therefore w"-? = —1 and «?("-2 = 1. This 
implies that 3 divides 2(n — 2), so n = 2 (mod 3), hence n = —1 
(mod 6), which is a contradiction. 


We may therefore write: if n = +1 (mod 6) then 


(2.1) 
(X + Y)" - (X° c Y") = XY(X +Y)(X? + XY +Y’)°E, (X,Y), 


and if n — p » 3 is a prime then 


(2.2) 
(X +Y) — (X? - Y?) = pXY(X + Y)(X? + XY +Y7)°C, (X,Y), 


where E,(X,Y) = pC,(X,Y) and e = 1, or 2, according to n = 
—1, or 1 (mod 6). 

There are numerous proofs of the above result (or variants of it) in 
the literature, to wit by Cayley (1878), Glaisher (1878, 1879), Muir 
(1878), Catalan (1884, 1885, 1886), Lucas (1888, 1891), Barisien 
(1906), Taupin and Retall (1907), Ursus and Grigorieff (1907), Can- 
dido (1907), and Bréié-Kostié (1952). 

In 1878, Glaisher expressed the above result in the following form: 


(2B)  Ifn is odd then (X — Y)" -- (Y — Z)” -(Z — X)" is divisible 


222 VII. Interludes 7 and 8 


by 


H(X Y? + (Y - Z} + (Z — El 
If n= —1 (mod 6) then the above polynomial is also divisible by 


IX — YY (Y —- Zy + (Z EL 


1 
2 
Ifn=1 (mod 6), then it is divisible by 


IX =- Y)t+ (Y - Z) + (Z - X)“. 


PROOF. Let A, B be indeterminates and let n > 3 be odd. Then 
AB(A + B) divides (A + B)” — A” — B”. 

Let A = Z-Y, B = X -Z so A+B = X-Y. Then AB(A+B) = 
(Z—-Y)(X-Z)(X-Y)- SR -Zy-HZ-XP-X-Yy] divides 
(LK —Y)"4+(Y —-Z)"+ (Z-—X)”. Next, 


A+AB+B = (Z-Y)?+(Z-Y)(X —Z)+(X -Z) 
= i(Y -Z)?+(Z-X)?+(X-Y)’] 
and 
(A? + AB + BI SR 


IL 


-(X-Y 
-(X-Y 


Hence by (2A) if n = —1 (mod 6) then 1[((X — Y)? + (Y — ZP 4 
(Z — X)?] divides the given polynomial while if n = 1 (mod 6) then 


E(X —Y)'--(Y — Z)* - (Z — X)*] divides the given polynomial. 


The following special result was explicitly given by Catalan (1884, 
1885), Gérono (1885), Nester (1907), Welsch (1909), and Brocard 
(1910): 


(2C) Let p be a prime number. 
(1) If (X4 Yy - X» -Y? = pXY(X +Y)P?, where P € Z[X, Y] 
then p=3, P=1, orp=7, P= X? 4 XY &- Y?. 
(2) If 2-1 —1 = pN? where N is an integer then p = 3, N = 
lorp=7,N=3. 
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PROOF. We establish the two statements simultaneously. First we 
observe that p #4 2. Taking X = Y = 1 we have 2? — 2 = 2pN?, 
where N — P(1,1) € Z. 

If p = 3 then N = 1. We assume now p Z 3. We have (2(-9/? + 
1)(20-0/2 —1) = pN?. The two factors in the left-hand side are odd, 
hence they are relatively prime. So, one of the factors is a square, 
while the other one is p times a square. 

But 2@-)/2 — 1 = 3 (mod 4) since (p — 1)/2 > 2, so 2@-D/? — 1 
is not a square. Hence 2(?-9/? +1 = M?, M being an integer. Since 
20-07 = M? — 1 = (M — 1(M +1), it follows that M — 1, M +1 
are powers of 2. But (M +1) —-(M — 1) =2, so M -1—2, M =3 
and necessarily p = 7, N = 3. Therefore 


(X Y) Ey 
7XY(X +Y) 


P? = 


= (X? + XY + Y?y. 


The polynomial C,(X,1) will be simply denoted by C,(X) and 
called the Cauchy polynomial for the prime p > 5. If p = 6k +1 then 
C,(.X) has degree 6(k — 1). We note the following special cases: 


C5(X) = 1, 

Cz(X) = 1, 

Cu(X) = X° 3X5 +7X 9X? + 7X7? + 3X +1 
= (X? + XY +Y’ e [XY(X +Y), 

Cis(X) = X°4+3X° 48X44 11X? 8X? 43X41 
= (X? + XY - Y?y +2[XY (X + VUE, 


These expressions in terms of XY (X +Y) and X? + XY +Y? will 
soon be generalized for arbitrary values of p. 

The Cauchy polynomials satisfy the following properties, where 
p = 6k +1 (see Mirimanoff (1903), Klósgen (1970)): 


(2D) 


( 
C,(X) = C,(-1 — X). 

(2) C,(0) = C(-1) = 1. 
l 


X) has no real roots. 
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k — 1 disjoint sets, each composed of six distinct roots 


E =} 
(EEN z i 


EE 
2, =; — Z), H 
1l+z 


PROOF. (1) Since 


0 (X41»-X?-1 
One): pX(X +1)(X2+X+1)¢ 


(with e = 1 or 2) then C,(1/X) = X-P+%#3C (X). From 


_ f 1 when p-—6k-—1, 
7 12 when p=6k+1, 


it follows that C,(X) = X9*-9C,(1/X). Similarly C,(X) = C,(-1 
—X). 

(2) Since C,(X) is a monic and symmetric polynomial then 
C,(0) = 1. Also by (1), C,(—1) = C,(0) = 1. 

(3) Ifz>0 isa real root of C,(X) then (z+1)? = z? +1, which 
is impossible. If z < —1 is a real root of C,(X) then —(1+ z) > 0 
would be a positive real root of C,(—1— X) = C,(X), which is a 
contradiction. Similarly, if —1 < z < 0 is a real root of C (X) then 
1/z < —1 would be a real root of C,(X), which is impossible. 

(4) Let z be any root of C,(X). By (1) it follows that 1/z 
and —(1 + z) are roots of C,(X), hence — (1 + 1/z) = —-(2+1)/2, 
—1/(1+ z) and — z/(z + 1) are also roots of C,(X). The sets of 
roots considered above are either equal or disjoint. Indeed, if 


1 z =) 
1-z' 14+2’ z 


1 
te M = fz, ,—(1- z), 
z 


it is easy to verify that M, = M,. 
Let z be an imaginary root of C, (X), and suppose it is a double 
root; this happens if and only if z is a root of Cj. Since 


; oe 1 ere e 
C;(X) = 6(k — 1) X8*- IC (=) =X aman OF (=) 


and C;(X) = —C;(—1 — X), it follows that 1/z and —1 — z are also 
double roots, hence each element of the set M, is also a double root. 
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From (X +1)? — X? — 1 = pX(X+1)(X7+X+1)°C,(X), taking 
derivatives we have 


(X +1) X?! = (X(X +1)(X? + X +1) C, (X) 
+ X(X +1)(X? + X LUNCH, 


Hence (z + 1)?7! = 271, so (1+1/z)’* = 1, that is, 1 + 1/z is a 
(p — 1)th root of 1, so |1 + 1/2| = 1. 

Beginning with the double root —z/(1 + z) (instead of z), we 
deduce that 1+ (—(1+z)/z) = —1/z is also a (p— 1)th root of 1, and 
|1/z| = 1. Thus, the triangle with vertices 0,1,€ = —1/z has sides of 
length 1, so it is equilateral, therefore € is a primitive sixth root of 1. 
So £?— ¿+1 = 0, and EI = —1. Then —(z+1)/z = -1—1/z = —1 +£ 
satisfies 


(—1+6) = -14 3€ — 367 + & = —1 +3 —-3€4+3-1=1, 


so its minimal polynomial is X? + X + 1. Since —(z + 1)/z is also a 
root of C,(X) then X?-- X +1 divides C, ( X), contradicting (2A). 


Mirimanoff also investigated in 1903 whether each polynomial 
C,(.X) is irreducible, and he conjectured this to be true. Klósgen 
(1970) has verified with a computer that if p < 31, the Cauchy 
polynomial C,(X) is an irreducible polynomial of Z[X]. Since the 
maximum of the coefficients of C,(.X) grows very fast with p, the 
test was not continued any further. 

We report without proof the following results, due to Helou, 1997: 

If n > 3, n odd, then for every prime p, the polynomial C,, modulo 
p is reducible over the field with p elements. Moreover, if n is an odd 
prime and for some prime p, Can modulo p has at most three factors, 
then C, is irreducible. Helou attributes to Filaseta the proof that 
C», is irreducible for every odd prime p; a proof is given by Helou in 
his paper (1997). More results about the Cauchy polynomials are in 
the paper by Terjanian (1989). 

We shall need another expression for the Cauchy polynomials. 
More generally, we shall express $,(X,Y) = (X +Y)"+(-1)"(X"+ 
Y") as a polynomial in U = X? + XY + Y’ and V = XY(X +Y). 

There are two methods to achieve this result. 
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We may consider the polynomial of degree 3 in the indeterminate 
T, with coefficients in Z[X, Y], whose roots are X + Y, — X, -Y: 
F(T) = (T+ X)(T+Y)\(T-X-Y) 
= TS + aT? + aT + Q3, 
with 
a,=X+Y-(X+Y) =0, 
dg = XY —-X(X+Y)-Y(X+Y)=-(X2+XY +Y?), 
as = —XY(X +Y). 
Then S,(X,Y) = (X + Y)" + (-1)^(X* + Y") is the sum of the 
nth powers of the roots of F(T). We shall use the following classical 
expression, due to Waring (1782), for the sum 


Pn = 2I bet pen (n=0,1,...) 


of the roots of a polynomial 


k 
p digne +. + Ap = J[T- £i). 
i=1 

The special cases n = 1,2,3,4 were known to Girard (1629); see 
also Saalschütz (1906). The proof may be found in Serret (1885), 
Lucas (1891), Perron (1951). A modern algebraic proof was given 
by Rédei (1952, 1959). Another proof, using power series, is given 
by Cesàro and Kowalewski (1904). 


LEMMA 2.1. With above notations: 


Pn = nT] D(a teca) Ta; 


i=l 


where {(a,+---+@x)'}n is the sum of all monomials (il/(il---281)) x 


a? «af (with i, +++» +i, = à, 0 X iy,... ip) having weight equal 
to n, that is, à, + 2ig +--- + ki, =n. 


The next result was proved for even exponents by Ferrers and 
Jackson (1852; see reference in Dickson’s History of the Theory of 
Numbers, Vol. II, p. 747); the proof for arbitrary exponent appears 
in Todhunter's book (1861). It was proved again by Muir (1879), 
using the above lemma. Kapferer (1949) rediscovered it, with the 
same proof, when n is a prime. In 1969, Carlitz and Hunter gave a 
proof with the method of power series. 
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(2E) Letn >2, let U = X?+ XY +Y?, and let V = XY(X+Y). 
Then 
OEY P(x)" 4+ EY)” 


= n r 3r—n4-3yrn—2r—2 
3 3 xí. 3 hh us 


(summation for max(0, (n—3)/3} <r € (n—2)/2, with the conven- 
tion that if n is even and r — (n —2)/2 then the coefficient of the 
summand is equal to 2). 


PROOF. FIRST PROOF (Muir). By the above lemma 
CX EX) VECSX) CY) 


= CU (ea. 


KO 3 3 
CE) d i2 | da 
=n ; ^ 
2. L E iali Sa 
2i54-3i3—n 
TL 
(i2 + 73 — 1) 
— —1y 12 gis 
2 / ) P334 "SEE 23 
2i5--3ig—n 
(ig +i — 1)! i i 
DI 


(sum for all i2, 43, such that 0 < io, 13: 1 < 49 + i3; 2i2 + 373 = n). But 
—a2 = U, —a3 = V and 


(i--i4—1) 1 p 


inl ig! alt 


(valid when i2 £ 0 or i3 £1). Let i2+i3 —1 = r, hence n —2—2r = 
(245 + 33) 2(i2 I 13 1) I= 13. Therefore 12 = 3r -n+ 3 and 


(X +Y)"+ (-X)"+ (-Y)" 


E n T 3r—n-F3Y7rn—2r—2 
"Xie sj" d | 


the sum being extended for max (0, (n — 3)/3) € r € (n—2)/2. 
Moreover, if n is even and r — (n — 2)/2 then the corresponding 
summand has coefficient 2 (since i2 = n/2, i3 = 0). 
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SECOND PROOF (CARLITZ & HUNTER). This proof does not 
require Waring’s lemma. We consider the identity 


Z | X x Y 
LEZ LES 1-YW 
(X+Y +Z)-2(XY +YZ+ZX)W +3XYZW? 


E EE EE XYZW®' 


Let Z = -(X + Y), so XY - YZ + ZX = -U, XYZ = -V and 
changing signs: 


(2.3) 
(X + Y) X Y | -2UW -3VW? 
1+(X+Y)W 1-XW 1-YW 1-UW?-HVWS*' 


The formal power series expansion of 1/(1 — UW? + VW?) is 
1 1 


1I-UW -VW*3 | 1-W*(U-VW) 


Y W*(U-VWy 


r=0 


j» Ww? SÉ (") gw 


r=0 s=0 


ae = _1)% n T 3r—ny;n—2r 
2 1)°W V 


r=0 


(this last sum is actually for n/3 € r < n/2; by convention the other 
summands are zero). The left-hand side of (2.3) is equal to 


yop? A vnm (ayn py. 


n=0 
The right-hand side of (2.3) is equal to 


oo 


_] n+1l n-4-1 2 T 3rrl—nyrn—2r 
Seam eh 


n=0 
= 1 n+2 wr 3 r U?r-^"yntl-2r 
FICUTI Aaa 


VII.2. The Cauchy Polynomials 229 


Sci orale Ge Pe nyn- 2r 


n=0 r=0 


Ex n4-1 n+1 r 3r+l—-ny7n—-—2r 
+> yer 3 Mam V 
—2UW --3VW? — 2U?w? 


_ n +1 r 
E S1) Dti e | U3r*i-nyn-?r 
e n—1-—2r 


n —2r 


= —2UW + 3VW? — An: 
- X n+l r 
2e n n 3r-J-2—nY7zn—1-—2r 
ips ip renal La] V 


son= d= 2r 


because 


9 T 43 T | nt+2 T 
n — 2r n—1—2r] n—2r\n—1-2r 


(for n > 3). On the other hand, the left-hand side of (2.3) is equal 


= Sey + Cxy + CYP 


Comparing the two sides of (2.3), as computed above, yields the 
statement. 


We note the following special cases. Taking an odd exponent, 
(2.4) (X + a ee — X?2k+1 = y2k+1 


= x 2k +1 T Ur 2k H2 2k-2r-1 
2k—1—2rV2k —2— 2r f 


2(k-1)<r<k-1 


If p is a prime, p Z 2,3, letting p = 6k +1, we have (see F. Lucas, 
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1897, and Kapferer, 1949): 


(2.5) (X+Y)? — X? — Y? = pXY(X+Y)(X°+XV+Y’) 
x ix €" ü Ji ye-n/2-3i yi 
= 2i 2i 1 
when p = 1 (mod 6), and 
(2.6) (X Y) — X? YP  pXY(X - Y)(X? + XY - Y?) 
x Ke KS E jas U (P-5)/2-3i y 2i. 
= 2i 2i+1 
when p = —1 (mod 6). If the exponent is even, we have 


(2.7) (X +Y) + X?* +Y” 


= k r 3r—2k-4-3172k—2r—2 
B x k—1 Aë 3 DE d | 


(2k—3)/3<r<k—1 


In 1879, elaborating on previous papers of Glaisher (1878, 1879), 
Muir found the following recurrence relation and divisibility proper- 
ties of the polynomials S,(X, Y): 


(2F) 
(1) VS,(X, Y) + USp41(X,Y) = Sn43(X,Y) forn > 0. 
(2) 65,(X, Y) p 3S2(X, Y}Sn-2(X,Y) E 285(X, Y )Sn-3(X, Y) 


forn> 3. 

(3) If n 2 0 (mod 6) then U,V YS,(X,Y). 
Ifnz1 (mod 6) then U?V | S,(X,Y). 
If n =2 (mod 6) then U | S,(X,Y). 
If n =3 (mod 6) then V | S,(X,Y). 
If n z 4 (mod 6) then U? | S,(X, Y). 
Ifn=5 (mod 6) then UV | S,(X,Y). 


PROOF. (1) Let n = 2m. By (2.4) and (2.7) and noting that 


2m r—i E 2m + 1 r—i 
2m — 2r \2m — 1— 2r 2m + 1—2r \2m — 2r 


" 2m + 3 T 
| 2m -1— 2r \2m — 2r 
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it follows that 
V Som(X, Y) + USam4i(X, Y) = Som+3(X, Y). 
Taking n = 2m + 1 and proceeding similarly, we deduce that 


VSom41(X, Y) +U Som42(X, Y) = S2m44( X,Y) 


and this proves (1). 
(2) Noting that S4(X, Y) = 2U and S4(X,Y) = 3V, it follows 
from (1) that 
8S2(X,Y )Sn-2(X, Y) + 253(X, Y)S,-3(X, Y) 


(3) This follows immediately from (2.4) and (2.7). 


Using an extension of Waring’s formula, MacMahon indicated in 
1884 some more algebraic identities of the same family. 
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VIII 
Reformulations, 


Consequences, and Criteria 


In this chapter we give a variety of results, a good indication of the 
wide search for solutions of Fermat’s problem. There are reformula- 
tions into equivalent problems, also a number of consequences of the 
truth of Fermat’s theorem as well as statements which follow from 
the assumption that the theorem is false for some exponent. 


VIIL1. Reformulation and Consequences of Fermat’s Last The- 
orem 


In this section, we shall indicate some propositions which may be 
proved if we assume the truth of Fermat’s last theorem. Among 
these propositions, some imply, conversely, the truth of Fermat’s 
last theorem. 


A. Diophantine Equations Related to Fermat’s Equation. 


There have been many instances where certain diophantine equations 
were compared to Fermat’s equation. We describe in succession the 
results originated with Lebesgue, Christilles, Perrin, Hurwitz, and 
Kapferer. We also discuss briefly the equations of Frey which played 
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a central role in the recent approach and solution of Fermat’s prob- 
lem by Wiles. 


Al. Lebesgue 


The first result of this kind in the literature is due to Lebesgue in 
1840. It was proved again by Terquem in 1846 and rediscovered by 
Pocklington in 1913: 


(LA) If Fermat’s last theorem is true for the exponent n > 3 then 
the equation X?" + Y?" = Z? has only trivial solutions. 


PROOF. Suppose that x,y,z are nonzero positive integers such that 
xz?" + y?" = 2?. Tt is easily seen that we may assume without loss of 
generality that x,y,z are pairwise relatively prime. Moreover, x, y 
cannot be both odd, otherwise z? = 2 (mod 4), which is not true. 
So, for example, x is even, y is odd, hence z is odd. 

Let x = 2?z', with a > 1, z' odd, z' > 0. Then (z+ y”)(z—y”) = 
2? — = g?” = Honn. We note that ged(z +y”, z — y”) = 2, so 


qnem oreo, 
z SS y” — DANS TER 


with r, s odd, positive, gcd(r, s) = 1. 
Adding and subtracting, we obtain 
z= pen Sec, 
9 


TL 


duy = r?” — DR gon — (pr — gancl gnum 4 201gr), 
Since ged(r” — 24-157, p^ + 2%-1s”) = 1 then 
{ pe + 2l mm. E> 0; 
r” — 201g = +u”, a >. 
Therefore 


1" Tc u” — gone — (2*s)". 
By hypothesis, we must have u = 0, hence r” = 2?"-1s" is odd, 
so an — 1, which is a contradiction. 


As a corollary, Liouville proved in 1840 (see also Terquem, 1846): 


(1B) If Fermat’s last theorem is true for the exponent n > 3 then 
the equation X?" — Y?" = 2Z" has only trivial solutions. 
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PROOF. Suppose that x,y,z are nonzero integers such that xz?" — 
y?" = 22". Let t = y?" + z^, so t — z^ = y?" and also t+ z* = x”. 
Hence t? — 2?" = (rain, so 27” + Lewin = t?. 

By (1A) we must have t = 0, and then necessarily z = 0, zy = 0, 
which is a contradiction. 


A2.  Christilles 


'The next result was proved by Christilles in 1967. We begin with 
a lemma: 


LEMMA 1.1. The equation X? 4- Y? + Z? = 3XY Z has a solution in 
nonzero integers x,y,z if and only if x +y+z=0 orzrt=y=z. 


PROOF. We have the identity 


(11) X? +Y?’ + Z? -3XYZ 
= LN HL SINT +Y? +Z? - XY -YZ —-ZX). 


If z, y, z are nonzero integers then z? + y? + 23 = 3xyz if and only if 
z+y+z=0or r? +y? +2 = ry +yz + zx. 

However, for any integers x,y the equation Z? — (x + y)Z + (a? + 
y? — zy) = 0 has solutions z = ((x + y) + (x — y) V —3)/2, which are 
integers exactly when x = y, and in this case x = y = z. 


(1C) Letn 23. The following statements are equivalent: 


(1) Fermat's last theorem is true for the exponent n. 
(2) The only solutions in nonzero integers x,y,z of X?" + Y?" + 
Z3 eua yn Z" arexz=y=z. 


PROOF. (1) — (2) Assume that z, y, z are nonzero integers which 
are not all equal and such that x°?” + y?" + 29" = 3x”y”z”. By the 
lemma we have z" +y” + 2” — 0. 

(2) — (1) Suppose that x,y,z are nonzero integers such that 
x” +y” +z” = 0; then x,y, z cannot be all equal and by the lemma, 
qn + y? + PU = Bays”. 


Christilles has also indicated in the same paper the following suf- 
ficient condition for Fermat’s theorem: 
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(1D) Letn>3. If X?+Y"4+ Zn = 5 X"ny"Zn(Z?n — XY") 
has no solution in nonzero integers, then Fermat’s last theorem is 
true for the exponent n. 


PROOF. Suppose that x,y,z are nonzero integers such that xz" +y” + 
z" — 0. Then 


—29^ = (g^ py")? = a9" ty E Bx" y" (a° $y") - 10x?" y?^ (z^ Ey"). 
So 
q^ py? + 29» = 5r y" [x9^ + y^ — 33^ y^ z^ + ay z^]. 
By (1.1) and the hypothesis z?" + y?" + 2?" = 3z"y"z". hence 


xr + y?" + 2°" = Gal — oy” with x,y,z Æ 0, proving the 
result. 


A3. Perrin 


Perrin showed in 1885 the following fact concerning the cubic Fer- 
mat equation: 


(1E) The following statements are equivalent and true: 


(1) Fermat's last theorem is true for the exponent 3. 
(2) For every n 2 1 the equation 


(1.2) X? + Y? 43312 = 2 x 3 XYZ 
has no solution in nonzero integers x,y,z, not multiples of 3. 


PROOF. (1) — (2) Suppose that there exists an integer n > 1 and 
nonzero integers z, y, z, 3 / xyz, satisfying equation (1.2). Let 


Since x,y,z # 0 and 3 / xyz then a,b,c Z 0. From (1.2) we have, 
taking the square and the cube 


ot yo Län ef ear 42 x 3391 73 2342 x 380 Larei = 4x3" 32, e? 
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and 
L? + y? +3732 4 33999 + Ba + 339523 
42 x 390434323 4 330623 4 360-1736 4 36n-1,,3 6 

= 8 x 3°" gz y3z3. 
Next we have 

a? = HR 2S ur ae um z? + 3t gtyz SE a), 

b? = y? (apo = aere + any iz _ y5), 

ci = ganan y? = EEN + Br ue - gu» 
Adding these equalities, taking into account (1.2) and the preceding 
relations, leads in a straightforward manner to 

a? + b? + c — 0. 


(2) — (1) Suppose that there exist nonzero integers a,b,c such 
that a? +b? -- c? = 0; we may assume a, b, c pairwise relatively prime. 
From the identity 


(X-Y-c-Z-2X?-Y? + Z2 -3(X - Y)(Y - Z(Z 4 X) 
it follows that 
(a 4- b -- c? = 3(a 4- b) (b -- c)(c +a). 


But a+b, b+c, c+a are also pairwise relatively prime; for example, 
if a prime p divides a + b and also b+ c then since p divides a 4- 6 4- c 
it would divide c and a, contrary to the hypothesis. 

Hence, one of the factors, say a 4- b, is a multiple of 3, and neces- 
sarily there exist integers n > 1, x,y,z such that 


a +b = 3125, 
b+c= x, 


and 3"xyz = a +b+c. Hence 
e+ y? + 39^7173 = 2(a + b+ c) = 2 x 3"xyz. 


Finally, x, y, z Z 0 (if, for example, z = 0 then a = —b so a = —b = 
+1 which implies c = 0) and also 3 / xyz because a + b, b+c, c+a 
are pairwise relatively prime. 
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A4. Hurwitz 


In 1908, Hurwitz considered the diophantine equation 
(1.3) X™Y"+Y™Z" + Z”X” =0, 


where m > n > 0 and ged(m,n) = 1, without loss of generality. He 
proved: 


(1F) The above equation has only the trivial solution if and only 
if Fermat’s theorem is true for the exponent m? — mn + n°. 


PROOF. Let x,y,z be nonzero integers such that ui + y"z^" + 
zx" = 0. We may assume gcd(z, y, z) = 1. 

Let u = gcd(y, z), v = gcd(z,x), w = gcd(x,y). Then u,v, v are 
pairwise relatively prime. Hence vw divides x and we may write 
xz = zıvw, x, an integer. Similarly y = ywu, z = zuv with y1, 21 
integers. Moreover, z,,5,,2; are pairwise relatively prime. Substi- 
tuting in the equation and dividing by u”v”w” we have 


eye "w^ +y zw u + za ur v = 0. 
Thus u™—” divides zf*yjv"-"w". But gcd(u,v) = 1, gcd(u, w) = 1 
and gcd(u,x) = 1, hence gcd(u,z,) = 1. So um" divides y}. On 
the other hand, y} divides zj"xztu" ^" v". Since yi is relatively prime 


to æ1,21, and v then y? divides u^". Hence y} = Xu". 

In the same way x? = tw™”, zi = v". Since ged(m,n) = 1 
then ged(m,m — n) = 1, hence zı = wP”, y = up”, z1 =v” 
for some integers ui, v1, W1 such that u = tu}, v = tui, w = twi. 


It follows that 


wi "es m E Son ab ut mmn n)n, e TUR i 


£ VETO yny m Tom du eux 


mn n? mn n? mn 


Multiplying with u% "yr wj , and noting that m? — 
mn + n? is odd, we obtain the equation 


wp mee + (tuyere + (zv, m! omnee? = 0. 


Conversely, if the nonzero integers u,v, w satisfy the relation 


um mnm? + ym omne? + wm wnt? ES 0, 
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mn mn 


multiplying with u""o""4)"^. we obtain 


um te Mnn Lg + mn mn y am + mn, mn = (. 


U uw uy 
Letting 
L=v"w"™, 
pa n,,nm. 
y-—w'u, 
z = u"v”, 


we obtain xy” + y™z” + 2027 = 0. 


As a special case, letting m = 3, n = 1, it follows that 
MY +Y°Z+ZSY =0 
has only the trivial solution. 
Ab. Kapferer 


In 1933, Kapferer published a proof that Fermat's last theorem is 
true for the exponent n > 3 if and only if the equation 


(1.4) Z Y? = 3? x 22x 


has no solution in nonzero pairwise relatively prime integers x, y, z. 
The proof of Kapferer contained a flaw, partly corrected by Riben- 
boim (communicated to Inkeri). Inkeri found and proved (correctly) 
the results which we give below. Comments by Gandhi and Stuff 
(1975) on this matter were inaccurate. 
We begin with the special case n = 3, which was given by Fueter 
(1930): 


(1G) The equation 
(1.5) Z—y^-89.*5 Xx 
has no solution in nonzero integers x,y,z with gcd(y, z) = 1. 


PROOF. Assume that z,y,z are nonzero integers such that 
gcd(y, z) = 1 and 
Saf SB ND s, 


Let 


v = 6723 — y, 
w = DIS, 


| u = 6723 + y, 
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Then u,v,w € Z, w #0. We show that u £ 0,v Z0. Ifu=0 
then Gil = —y, so y # +1. If p is a prime dividing y, then p | 6x. 
So p | z, which is contrary to the hypothesis that gcd(y,z) — 1. 
Similarly, v = 0. 

Finally, u? + v? = (622? + y)? + (6723 — y)? = 2 x 68a? + 62339? = 
6333 (2 x 6336 + y?) = 6%x%2% = w?. This is, however, impossible. 


It may be shown in the same way that there are no integers 
2,9, 2,4 0 with gcd(y, z) = 1 such that z? — 3y? = 242°. 

We give now Inkeri’s equivalence, which is a corrected form of the 
one previously given by Kapferer (1933). It also involves the related 
equation 


(1.6) VAE) 299 X. 


(1H) ` Let n 2 3 be an odd integer. The following statements are 
equivalent: 
(1) Fermat's last theorem is true for the exponent n. 
(2) The equations (1.4) and (1.6) have no solution in nonzero 
integers x,y,z with gcd(y, z) — 1. 


PROOF. (1) = (2) We assume that there exist nonzero integers 
æ,y,2 such that gcd(y,z) = 1 and z? — y? = 3? x 2?^—?g?^. Let 


a=y, 
b=3 x Qrlyr, 


C= 2, 


so a? +3b? = el, Then c is odd and ged(a, b) = 1 because gcd(a, c) = 
gcd(y, z) = 1. 

By Chapter I, Lemma 4.7, there exist integers r,s 40, gcd(r, 3s) = 
1, r Æ s (mod 2), such that 


a = r(r? — 9s), 
b = 3s(r? — s?), 
c — r? + 352. 


Then 
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Since x Æ 0, then s, r — s, r+ are nonzero. Also s is even, because 
r Æ s (mod 2). Since gcd(r, s) = 1 then s, r — s, r + are pairwise 
relatively prime. So there exist nonzero integers u,v,w such that 
2s = w”, r — s = u^, —r — s = v". So u” +v” +w” = 0. We note 
that u,v, w Z 0 and gcd(u, v, w) = 1. 

Now we assume that there exist nonzero integers x, y, z such that 
gcd(y, z) = 1 and 23 — 3y? = 2?"-1y?". Let 


— 9n—1l,.n 
aS SET, 
b=y, 

C= 2, 


so a? + 3b? = œ. Again c is odd and gcd(a, b) = 1. 
By Chapter I, Lemma 4.7, there exist nonzero integers r,s, 
gcd(r,3s) = 1, r Z s (mod 2) such that 


a = r(r? — 98°), 
b = 3s(r? — s?), 
c= r? + 352. 


Then 
2^-1g^ = a = r(r — 3s)(r + 3s). 


We note that r is necessarily even, since r — 3s, r + 3s are odd. So 
r,r — 3s, r + 3s are pairwise relatively prime. Hence there exist 
nonzero integers u, v, w such that 2r = w”, —r+3s = u”, —r— 3s = 
v" and so u” +v” + w” = 0, with u,v, w Z 0, gcd(u, v, w) = 1. 

(2) = (1) We assume that u, v, w are nonzero integers such that 
u^ + v" +w” = 0. Without loss of generality, we may assume that 
u, v, w are pairwise relatively prime, w is even, and u,v are odd. Let 
s=w"/2andr=u"+s. Hence 2s = w”, r — s = u”, —r— s — v^, 
Let 

a = r(r? — 98°), 
b = 3s(r? — s?), 
c= 7? + 3s’. 


By substitution, we obtain a? + 3b? = œ and b = 3s(r? — s?) = 
—3 x 2"-1 (Iuvw)'. Let 

x= zwu, 

y =a, 

eme 
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so 2% — y? = 3% x An Zorn. Since gcd(u,v) = 1 then gcd(r,s) = 1 
and x # 0 by assumption. Now w is even, then s is even, so y is odd, 
hence y Z 0. Next, 3 / uvw if and only if 3 | r. Indeed, if 3 Y uvw 
then 3 Fair? — s?), so necessarily 3 | r, and conversely, if 3 | r, from 
gcd(r, s) = 1 then 3 J/s(r? — si), so 3 f uvw. 

Now we show that gcd(r,y) = 1 which will be done in several 
steps: 

(a) Since s is even and ged(r, s) = 1 then r is odd. Then gcd(r? — 
s?, r? — 9s?) = 1. Indeed, if p | r? — s, and p | r? — 95? then p Z 2 
and p | 8r?, p | 857, so p | gcd(r, s) = 1, which is impossible. 

(b) ged(r,r? — s?) = gcd(s, r? — 9s?) = 1 because gcd(r, s) = 1. 

(c) Then gcd(a,b/3) = 1. Since x | b/3 and y = a then 
gcd(z,y) = 1. Since y is odd, we also have gcd(2z, y) = 1. Next 
we have: gcd(a,b) = 1 or 3. More precisely gcd(a,b) = 1 when 
3 / r and gcd(a, b) = 3 when 3 | r. We note 3 | r if and only if 
3 | r(r? — 9s?) = a and this is equivalent to 3 /uvw. 

Also gcd(y,z) = gcd(a,c) = 1 or 3 according to 3 Jr or 3 | r. 
Indeed, if p is a prime, e > 1 and p° | gcd(a, c) then from a?+3b? = &?, 
p? | 3b? so p | b, hence p = 3, gcd(a, b) = 3 and 3? Yc, showing that if 
gcd(a, c) Z 1, then gcd(a, c) = 3 and 3 | r. Conversely, if 3 | r then 
gcd(a, b) = 3, so 3? Fe and by the above, gcd(a, c) = 3. 

If gcd(y,z) = 3 then 9 | y. Let xı = z, yı = y/9, z = 2/3, 
then 2? — 3y? = 227-472" with x1, y1, 21 Æ 0, ged(yi,z1) = 1. This 
concludes the proof. 


For further use, we note that (with the above notations), gcd(2z, y) 
= 1. Here is a related result: 


(11) ` Letn>3. The following statements are equivalent: 


(1) Fermat’s last theorem is true for the exponent n. 
(2) The equation (1.4) has no solution in nonzero integers x,y, z 
such that gcd(2x, y) = 1. 


PROOF. (1) = (2) Assume that x, y, z Z 0, gcd(2x, y) = 1 and z?— 
y? = 3° x 2?"-!5?". Tf p is a prime and p | gcd(y, z) then p 2x, so 
p = 3, so 3? | y?, hence 9 | y. Let xı = x, y = y/9, 21 = z/3, 
so 23 — 3y? = 2?^^ 1g?" with 21, 41,21 Æ 0, gcd(y1, z1) = 1. By the 
implication (1) — (2) of (1H) then there exist u,v,w 4 0 such that 
u^ cr v" +w” = 0. 
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(2) = (1) Assume that u,v,w 4 0, gcd(u,v,w) = 1, and u” + 
v" +w” = 0. By the proof of the implication (2) — (1) of (1H) and 
the remark following this proof, there exist x, y, z 4 0, gcd(2z,y) = 
land 2) — y? = 38 x 2m AECH, 


We examine in more detail the situation: 


(1J) Assume that (1.4) has no solution in nonzero relatively prime 
integers. If u,v, w #0, gcd(u, v, w) = 1, and u” + v” = w”, then: 
(1) n is odd, u = v = —w (mod 3), 3 f uvw. 
(2) If p is any prime dividing n, then p = 1 (mod 6). 


PROOF. (1) By (2) = (1) in (1H), there exist nonzero integers 
x,y,z such that z? — y? = 3? x 2?^^!z?" with gcd(y, z) = 3 and also 
3 fuvw. 

By Chapter VI, (4L), n is odd, hence u” + v" + (—w)" = 0. Let 
r= u”, s = —v", so r,s Æ 0,r Z s (since w #0). We haver Z s 
(mod 3), since 3 yw. Then u = u” = r Z s = —v" = —v (mod 3). 


Since 3 Y uv then u = v (mod 3). By symmetry, u = v = —w 
(mod 3). 

(2) Let p be a prime dividing n, n = pt, u; = u*, v4 = vf, un = 
wt, so ul + uf + (~w)? = 0 with ged(u, v1, w1) = 1, u4 = v = ~w: 


(mod 3). By Chapter VI, (2A), we have p = 1 (mod 6). 


It was shown by Inkeri, using nonelementary methods (namely 
class field theory), that under the above circumstances, 3? = 3 
(mod p?); this congruence is very seldom satisfied by a prime p. 

It is appropriate to indicate now some facts about the equation 


(1.7) Z-Y? =c. 


(For more details, see my book, Ribenboim, 1994.) Euler proved 
in 1738 that if c = +1 the only solution of (1.7) in integers greater 
than 1 is y = 3, z = 2. It was shown by Siegel in 1929, using deep 
analytical methods, that for each given c there exist at most finitely 
many solutions for equation (1.7). 

Using (1H) we may show: 


(1K) If ce = 8*x2"-'u" where n > 3, x = 1, p or pq (for 
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distinct primes p,q), then equation (1.4) has no solution in nonzero 
relatively prime integers y, z 


PROOF. Assume that y, z are nonzero relatively prime integers such 
that 2? —y? = 33x27 222%, By the proof of implication (1) = (2) of 
(1H), we have 2"-!z" r(r?—5?) with r,s Z 0, r Æ s (mod 2), gcd(r, s) 
= 1. Then gcd(r, r? — s?) = 1 and clearly r? — s? Z 1. 

If x = 1 then r = +1, r? — s? = Fän, so s? = 2%"! + 1, which is 
impossible. 

If x = p then 2"~'p" = r(r? — s?). If p = 2, the above argument 
leads to a contradiction. If p Æ 2, then either r = 2" !, r? — s? = 


p", or vice versa. In the first case, r — s = +1,r+ s = +p” (or 
r — s = ctp", r + s = +1); in all cases, we reach a contradiction like 
Dec (y^ «i 1) or similar relations. 

If x = pq and ps Or 2 is 2, we use the preceding argument. If Prd 
are odd then 2"-1p"q" = € —s)(r+s). If r = +p” thenr— s = 
c2"! rts= ig i Fq”) or vice versa, and 2p" = +(2"-' +q”), 
which is impossible. Similarly, r Z +q”, sor = +2. l rs = 
+p", r + s = +q” (with appropriate sign) hence 2" = +p” + q”. By 


Chapter VI, (3B), this is impossible. 


The paper of Yahya (1973), where he published a proof of Fermat's 
last theorem, contained flaws of which one was the use of the incor- 
rect result of Kapferer. Yahya has also related Fermat’s equation to 
another diophantine equation. Inkeri examined this relationship and 
proved correctly the following statement: 


(1L)  Letn 23. The following statements are equivalent: 


(1) Fermat's last theorem is true for the exponent n. 
(2) The equation 


(1.8) 2x3 -Y?'X -Z"20 
has no solution in nonzero integers x,y,z such that gcd(z,y) = 
gcd(y, 22) = 1. 

A6. Frey 


We find in the thesis of Hellegouarch (1972) an elliptic curve asso- 
ciated to a hypothetical solution of Fermat’s equation with exponent 
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2p” (where n > 1 and p is a prime); the aim was to show the non- 
existence of points of certain orders in elliptic curves. 

In 1986, Frey had, independently, the same idea to associate to 
each solution of Fermat’s equation an elliptic curve. Explicitly, if a, b 
are nonzero relatively prime integers, n > 3 and a is even, the Frey 
curve is the curve with equation 


(1.9) Y? = X(X — a^)(X +b”). 


It is an elliptic curve, whose properties were studied by Frey. Ribet 
and Wiles studied this curve under the assumption that there exist 
a nonzero integer c such that a" + b^ = c". The use of advanced 
theories of elliptic curves, modular forms, Galois representations, 
led ultimately to a contradiction, thus showing that Fermat’s last 
theorem is true. We shall discuss this matter further in the Epilogue. 


B. Reformulations of Fermat's Last Theorem. 


The following statements, which are equivalent to Fermat's theo- 
rem, were given by Pérez-Cacho in 1946. The equivalence between 
(1), (2), (3), and (4) was first proved by Bendz in 1901 and was 
rediscovered by Krasner, who published his paper in 1939 (see also 
Rivoire (1968)), and by Chowla in 1978 (see also Inkeri (1984)). Lind 
indicated some partial results in 1909. 


(1M) Let m > 2, n = 2m — 1. The following statements are 
equivalent: 


(1) The equation X" +Y” = Z” has only the trivial solutions in 
integers. 

(2) The equation X(1-- X) — T" has only the trivial solutions 
in Q. 

(3) The equation X? = AY" +1 has only the trivial solutions in 
Q. 

(4) The equation X? = Y"*! — 4Y has only the trivial solutions 
in Q. 

(5) For every nonzero rational number a, the polynomial Z? — 
a" Z, +a is irreducible over Q. 

(6) The equation (XY )" = X +Y has only the trivial solutions 
in Q. 

(7) The equation X" = X/Y +Y has only the trivial solutions 
in Q. 


248 VIII. Reformulations, Consequences, and Criteria 


(8) If w,r are nonzero rational numbers, and if uj,u»,... is a 
geometric progression of ratio r, then u2, — ui +r #0. 

(9) If A is a triangle with vertices A, B,C, if the angle CAB = 
90°, if [AB] = 2, |AB| + |BC| is an nth power of a rational 
number then |AC| is not rational. 


Moreover, these conditions imply: 


(10) The tangents to the parabola Y? = AX at every rational point 
distinct from the origin, cut the curve Y = X™ at irrational 
points. 


PROOF. (1) — (2) Let a,b,c,d be nonzero integers, such that 
b > 0, d > 0, gcd(a, b) = gcd(c, d) = 1, and a/b(1 + a/b) = (c/d)”. 
Then a(a + b)/b? = c^/d^. Hence a(a + b) = c", b? = d". Since n is 
odd, then b = y" for some integer y. Since gcd(a,a + b) = 1, then 
a = x^", a + b = z”, for nonzero integers y, z. Thus x" + y" = z”. 

(2) — (3) Let x,y be nonzero rational numbers such that z? = 
Ay" +1. Then x Z +1 and ((z — 1)/2)((z — 1)/2 +1) = y^. 

(3) — (4). If x,y are nonzero rational numbers such that x? = 
y^*! — 4y, dividing by y"*! = y?", we deduce that 


2 n 
(=) -144 (=) | 
y™ y 

(4) — (5) The discriminant of Z? — a” Z + a is a^*! — 4a Æ 0 
(since n is odd). By hypothesis, a”*! — 4a cannot be a square, hence 
Z? — a" Z + a has no root in Q. 

(5) — (6) If z, y are nonzero rational numbers such that (zy) = 
x + y, let zy = a. Then Z? — a"Z + a has a solution in Q. 

(6) — (7) Let x,y be nonzero rational numbers such that x" = 
x/y +y. Let t = x/y so t+ y = (ty). 

(7) — (1) Let a,b,c be nonzero integers such that a” + b" = c^. 
Let x = c?/ab and y = cb"-! /a"^. Then 

x cat cmt c(a” +b”) E e 
= = -—-— 
y bom am amb ab 

(3) — (8) Assume that u2, = u,—r. Since um = ur”! then 
the equation r?^-) Z? — Z +r has the rational root u1, therefore its 
discriminant is a square, that is, 1 — 4r?"^! = si, Hence s and —r 
satisfy X? = 4Y” + 1. 

(8) — (3) The proof of the converse is similar. 
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If x,y are nonzero rational numbers such that 4y” + 1 = x? then 
the equation y2°"-) Z? — Z — y = 0 has a rational root w1. Let 
u; = uj(—y)^!; then u2, = u2y?n-9 = u; + (—y). 

(5) + (9) The condition (5) is equivalent to the fact that a?" — 4a 
is not a square in Q, that is, a?” — 4a” is not a square in Q, or still, 
(om IS = a?” — 4a" has no rational solution, for every rational 
number a Æ 0. Adding 4, this is equivalent to the non-existence of 
rational solutions for 


4+ (qa zy? = (a” — 2)’, 


which is in turn equivalent to (9). 

Now we show that: 

(7) — (10) Let (zi, ml Z (0,0) be a rational point of the parabola 
Y? = AX. The tangent to the parabola at this point has equation 


YY = 2(X + 21), 


that is, 
y? 
so 
2 
ene ares 
Yı 2 
The intersections of the tangent with the curve Y = X™ are the 
points (x,y) such that y = (2/yi)z + y1/2, y = x". Thus (2, y,/2) 
is a solution of the equation X" = X/Y +Y. By (7), x must be 
irrational. 


In 1958, Pérez-Cacho showed: 


(1N) Letn>2. There is a bijection between the following sets: 


(F) The set of solutions (x,y,z) of X" +Y” = Z”, where x, y, z 
are nonzero pairwise relatively prime natural numbers; and 

(F^) the set of solutions (u,v, w,t) of U"+V?" = W" FT?" where 
u,v, w,t are nonzero natural numbers, gcd(u, v) = ged(w,t) = 
gcd(v,t) 2 1, and w = vgcd(u, w), u = tgcd(u, w), t Z v. 


PROOF. Let (x,y,z) € F, let u = zz, v = y, w = yz, t = x. Then 
v Æ t, since x Æ y because 2x" is not an nth power. Also u” + v?" = 


250 VIII. Reformulations, Consequences, and Criteria 


creer pay?" = ar (arty) H y?" = (ar 4 y")y" +207" = yz” +07" = 
wr + t?n, 

We have u,v,w,t Æ 0, gcd(u, v) = gcd(w,t) = gcd(v,t) = 1, z = 
gcd(u, w) and u = tgcd(u, w), w = vgcd(u,w). Thus (u,v,w,t) € 
F". 

Conversely, let (u,v, w,t) € F’. Let x = t, y = v, z = gcd(u, w). 
Then w = vz = yz, u = tz = xz and x7" — y” = En — 449» = 
u” — w” = (x" — y")z^. Since t Z v then x” +y” = z^. 

Clearly the above correspondence between (F) and (F") is a bijec- 
tion. 


In 1979, Vranceanu indicated a less interesting property equivalent 
to Fermat’s last theorem for the exponent n. 

We give now an equivalent combinatorial (!) formulation of Fer- 
mat’s last theorem. It appeared in a short note by Quine (1989). 

Consider a set of n > 3 balls, which are to be arranged into z bins 
which are white, red, or blue. Let: 


W = number of white bins; 
B= number of blue bins; and 
R= number of red bins. 


Then z = W + B + R. Let: 


(r'b) = number of arrangements of the n balls into the bins, such 
that red bins receive no balls, but at least a blue bin has a 
ball. 

(rb') = same, but with no balls in blue bins and some ball in some 
red bin. 

(rb) = same, with at least a ball in some red bin and at least a ball 
in a blue bin. 

(w) = same, with all balls in the white bins. 


We have: 


(10) Fermat’s last theorem is true for the exponent n if and only 


if (w) # (rb). 


PROOF. We have 


Ee 


So 


= number of all arrangements of the n balls in the z bins. 


z” = (w) + (rb) + (rb) + (rb). 


VIIL1. Reformulation and Consequences of Fermat’s Last Theorem 251 


Let x = R+W and y = B+W. Then the number of all arrangements 
of the n balls into the bins which are red or white is 


x” = (w) + (rV). 


Similarly, the number of arrangements of the n balls which are blue 
or white is 


y = (w) + (r'0). 
If Fermat’s last theorem is true for n > 3 then z” Z x” + y" and, 


comparing, (rb) 4 (w). 
On the other hand, if Fermat’s last theorem is false for n > 3, let 
x,y,z be positive integers such that x” + y^ = z”. Let W, B, R be 


given by 
B=2z-2, 
R=z-y, 


W=x+y—z. 


By the above argument, (w) = (rb). 
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VIIL2. Criteria for Fermat's Last Theorem 


In this section we gather various results of a different nature, all 
proved with elementary methods. For the convenience of the reader, 
we classify them into subsections: 


A. Connection with Euler's totient function. 

B. Connection with the Möbius function. 

C. Proof that a nontrivial solution cannot be in arithmetical 
progression. 

D. Criterion with a Legendre symbol. 

E. Criterion with a discriminant. 

F. Connection with a cubic congruence. 

G. Criterion with a determinant. 

H. Connection with a binary quadratic form. 

I. The non-existence of algebraic identities yielding solutions of 
Fermat's equation. 

J. Criterion with second-order linear recurrences. 

K. Perturbation of one exponent. 

L. Divisibility condition for Pythagorean triples. 


A. Connection with Euler's Totient Function. The first result 
was proved by Pérez-Cacho in 1928, in a slightly weaker form: 
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(2A) Let x,y be nonzero relatively prime integers, let n > 2 be 
an integer, and let p be any prime factor of n,n = pm. Let z be 
any integer such that z > 3, z divides x" ty”, but z does not divide 


m 


zx" cy". Then p divides w(2). 


PROOF. If p = 2, it is true since z > 3, so (z) is even. 

Let p be an odd prime, n = pm, and suppose that netz), Then 
there exist integers r,s such that rp — sy(z) = 1. Since (z) is even, 
then r is odd. 

If we assume z?" = —y?™ (mod z) but zm # —y" (mod z), then 
grpm = —9y'?" (mod z) hence z(*)*0m = —ylsel)+)™ (mod 2). 
We have gcd(z, x) = gcd(z, y) = 1. Indeed, for example, if a prime 
q divides x and z, it divides x” + y", hence also y, contrary to 
the hypothesis. By Euler's theorem, z*(? = y*? = 1 (mod z), so 
x” = —y" (mod z), which contradicts the hypothesis. 

The proof is similar when z | x^ — y^, z | x^ — y". 


In particular, if gcd(z, y) = 1 and p | n and taking z = x" + y^, 
then p | y(x” + y"). As a matter of fact, as shown by Pérez-Cacho, 
this last assertion also holds when gcd(x,y) = d Z 1. Indeed, let 
x = dzi,y = dy, so gcd(xı, yı) = 1; then p | y(x? + yt); but 
z^ ty” = d'(x? + yf) hence p | y(x” + y”). 

As a corollary, we have the result proved by Swistak in 1969: 


(2B)  lfp is an odd prime and 0 < x < y < z are positive integers 
such that x? + y? = z? then p divides v(x), p(y), and p(z). 


PROOF. We may assume without loss of generality that x,y,z are 
pairwise relatively prime, because if, for example, x = dx, and p | 
im) then also p | y(x). 

We have 3 € z, z | z? + y? and z x + y; indeed, z? = x? + y? < 
(a+y)?,soz<a+y < 2z. By (2A), p | (2). 

Similarly, x | z? — y, x Yz — y since z — y < x. Also 


x? = (z— y) Le + pz? "ut B XE. 22) 


> 297p 2, 


so x > 3. By (2A), p | y(x). 
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Finally, y | z2 — x”, y Y z — x since z — x < y and 3 € x < y, so 
again p | p(y). 


Bussi indicated in 1943 the following corollary: 


(2C) Ifx,y,z are pairwise relatively prime positive integers, if p 
is a prime, p Y xyz, and x? + yP = 2”, then there exist primes q,r, s, 
such that q=r=s=1 (mod p) andq|z,r|y,s|z. 


PROOF. By (2B), p divides y(x), (y), p(z). Since p Y xyz then 
there exist primes q,r, s, such that pl q— 1, p|r—1, p| s— 1, and 
q|z,r|y. sz. 


This result is also a corollary of Chapter III, (1B). Another remark 
made by Bussi in 1932 is the following: 


(2D) Letn> 2, and let x,y, z be pairwise relatively prime positive 
integers such that x"+7" = z”. If k is any integer such that p(k) =n 
then gcd(k, xyz) > 1. 


PROOF. If gcd(k, xyz) 21 then 2?) =y?) = 7%) =1 (mod k). But 
p(k) =n, so 1 = z” = x" + y^ = 2 (mod k), which is absurd. 


B. Connection with the Möbius Function. Rameswar Rao 
proved in 1969: 


(2E)  Ifn >3 is an odd integer, and if x,y,z are positive integers 
such that x" + y” = z”, then the Mobius function u vanishes for 
Fu. 


PROOF. Since n is odd x+y divides x" +y” = z”. Any prime factor 
p of x + y divides z” hence divides z. 

If u(x 4- y) Æ 0 then x+y has no square factor, hence x+y divides 
z, in particular x+y € z. This is a contradiction since z < x+y. 


C. Proof that a Nontrivial Solution Cannot be in Arith- 
metical Progression. The next result is due to Bottari (1907); it 
was rediscovered by Goldziher (1913), by Mihaljinec (1952), and by 
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Rameswar Rao (1969). In 1908, Cattaneo gave the following simple 
proof, which is exactly the same as Rameswar Rao's: 


(2F) Ifn>2andz,y, z are positive integers such that x"+y" = z” 
then x,y,z cannot be in arithmetic progression. 


PROOF. Let us assume the contrary, so there exists a positive integer 
a such that z = y — a, z = y +a. Then 


(2.1) (y—a)"+y" = (y +a)". 


Dividing, if necessary, by the greatest common divisor of a, y, we 
may assume that gcd(a, y) = 1. We also see that y cannot be odd. 
The relation (2.1) implies that y” = am, for some integer m. Since 
gcd(a, y) = 1 then a = 1, so (y — 1)” + y^ = (y+ 1)”. 

We see that n cannot be odd, otherwise 


and since y is even then 2"^! would divide the odd number in brack- 
ets. 
Since n is even then dividing by y: 


[e e | 


so y"! = 2l. Hence y/2 divides l. Since y/2 divides each summand 
in the bracket but the last one, it follows also that y/2 divides (,",) = 
n, SO y < 2n. 

Hence y^-! > 2(7)y"^? > y"-!, a contradiction. 


D. Criterion with a Legendre Symbol. In 1958, Pérez-Cacho 
indicated the following criterion: 


(2G) Let p be an odd prime. Assume that if x, y, z are any nonzero 
relatively prime integers then there exists a prime q, q # p, q dividing 


z? — pry and 
p+1 __4 
(? d Ae 
q 


Then Fermat’s last theorem is true for the exponent p. 
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PRoor. We assume that there exist nonzero relatively prime integers 

x,y,z such that x? + y? = z”. By hypothesis, there exists a prime 

q, q £ p, such that q divides 2? — pry and ((p?*! — 4p)/q) = —1. 
From 27? = g? + 2z?y? + y?" it follows that 


2? P+D — Az? gPyP = z?(g?P — 2gPy? +y?) = [e(z — yP). 
Let k be defined by z? = pry + kq. Then 


(p^ — 4p)(zy)"*! = (2? — kq)? ™ — A(zy)?(2° — ka) 
m QD 4z? gPyP 


= [z(a? — y^)? (mod oi. 


Since p + 1 is even, then p?*! — 4p is a square modulo q, which is a 
contradiction. 


E. Criterion with a Discriminant. In 1949, Kapferer gave a cri- 
terion involving the discriminant of a certain polynomial. In the 
proof of this result we shall require some facts about the resultant 
and the discriminant of binary forms, which have been gathered in 
Chapter II, 84. 

'The following lemma was explicitly used by Kapferer; a proof is 
given in his paper. 


LEMMA 2.1. Let F(X,Y), G(X,Y) be forms of degree n, m, respec- 
tively, let LX, Y), M(X,Y) be forms of degree k, and let 


$(X,Y)— F(L(X,Y), M(X,Y)) 
(form. of degree kn), 

T(x, Y) = G(L(X,Y), M(X,Y)) 
(form of degree km). Then 

R(®,T) = [R(F,G)|" [R(L, MIT, 


PROOF. The result is trivial if n = 0 or m = 0, so we assume n > 
1, m 1. 

Let F(X,Y) = [Ia (a, X — o;Y) where oj, o/; are not both zero 
(for each à = 1,...,n). Thus ®(X,Y) = F(L,M) = IIa(o;L — 
aiM). Similarly, let G(X, Y) = IDA (8;X — 8, Y), so (X, Y) = 
T (BL — 8M). 
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Then by Chapter II, (4A), 


R(®,T) = [[[[ R(oíiz - aM, BL — 6;M). 
i=1 j=1 
For example, let 8; 4 0; then 
R(o;L = aiM, B;L = 8;M) 
1 / / 
SR zx Lan, — Booch, B;L = B;M) 


Bj 
1 


E 


CAL 
= (Bras ^ PD ou, B;L — 8;M) 


R (Bai — 0485) L. B;L — 8M) 


Í 
QR 
E 

| 
D 
Pe 
Ue 
E 
m 


Hence 


Now we give Kapferer's criterion: 


(2H) Let p be a prime number, p > 7. If there exist nonzero 
integers x,y,z such that 


p Vxyz(a — y)(y — z)(z — x) (X +y? + 27) 


and x? + y? + z? = 0 (mod p°), then p divides the discriminant of 
the homogeneous polynomial 


(ue (PT? _ A 1 SE 
K,(X,Y)= 2 KPa yt 
(XY) > | 9i Ja 
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when p = 1 (mod 6), 


(8) / HET GN. d = 
K,(X,Y) = 2 er na 
cass: | , Jai 


when p = —1 (mod 6). 
PROOF. Indeed, 
zer (modp) y=y (modp) ses (mod p), 


hence x + y+ z= 0 (mod p) and so z? = —(x +y)? (mod p°). Thus 
(x + y)? — z? — y? =0 (mod p°). 

We note that y —z = z--2y (mod p), z— x = —(2r4- y) (mod p), 
z^ y^ +2? = 2(z^- ty + y^) (mod p), hence p Y zy(x + y)(z — 
y)Qx + y)(2y + z)(z? + zy + y’). 

We have seen in Chapter VII, (2.2), that 


(X +Y) — X? — Y? = pXY (X +Y)(X? + XY + Y*YOQXY), 


where C (X,Y) € Z[X,Y] is the homogenized Cauchy polynomial 
and e = 1 or 2, according to whether p = —1, or 1 (mod 6). Thus p 
divides C, (x, y). 

We show that p divides the discriminant of C,(X,Y). Indeed, let 
Q(X,Y) = XY(X + Y)(X?4+ XY + Y?)5, so 


; (X + Y)" - x? nl = QU, Y)C,(X, Y). 


Hence taking the partial derivatives: 


(cvy - xe = qoo v) ocv) + Sac yo, ocv), 
(X+Y) t- Y”! = Q(X, yje (X,Y) + 4 29 (x. PCA VO 


OY oY 


Since p Y zy(x + y) then z?^! = y?! = (x y)! = 1 (mod p); 
on the other hand, p / Q(x,y). Therefore, since p divides C,(z, y) 
then p | (0C,/0.X)(x, y) and p | (0C,/0Y )(x, y). So, 


OC, OC, 


Discr(C,(X,Y)) - R E OG Y), OY 


——(X, Y) =0 (mod p). 
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Now we use the expression of C,(X,Y) as a homogeneous poly- 
nomial in 
L(X,Y) = (X? + XY + Y2ÿ = US, 
M(X,Y) = X2Y?(X +YP = V?, 
which was derived in Chapter VII, (2.5) and (2.6): C,(X,Y) = 
K,(L, M) where 


@-D/6 (PT? iN. 1 E? 
K,(L, M) = 2 SL OOM 
when p = 1 (mod 6), 
(äu (PT? iN 1 Mes 
K,(L, M) = 2 —— LEE yg 
when p = —1 (mod 6). We have 
0C, _ OK, OL OK, OM 
OX | OL OX OM Ox" 
oC, óK, OL OK, “OM 
OY OL OY OM OY’ 


where 

OL 2 2)2 

OM 

— = OXY AK LY Xy 

x (X & Y)GX +»), 

L 

Gel = 3(X? + XY + Y°) (X +2Y), 

OM 

— = BEER +Y)\(X +2Y). 

M (X +Y)(X +2) 
Letting L(x, y) =r, M(r, s) = s, then p /rs and from 

OC, _ OC, u 

ax (OY) = sy SY) =9 (mod p) 
it follows that either 


OK, _ 0K, 8 
(r,s) = Sin, lf (mod p) 
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or 
OL OM 
det on i pe — 0 (mod p). 
OL OM 


ay ev) ay 99) 
Computing the determinant, this condition becomes 
6(z? + zy + y?" (2x + y)(x + 2y)zy(x + y)(x — y) =0 (mod p). 


However, by hypothesis this determinant is not a multiple of p. 
Therefore 

OK, _ 0K, 
ðL (r,s) RM OM (r,s) 
that is, p divides the resultant of the binary forms (0K,/0L)(X,Y), 
(0K,0M)(X, Y), having degree m — 1, where 


=0 (mod p), 


Rug 

= 6 
me —5 
— when p=—1 (mod 6). 


when p=1 (mod 6), 


Since L(X,Y), M(X,Y) have degree 6, by Chapter V, (1A), 


aK, aK, 
lan, Sin 


= E (Sa. wn) ech wll ax Y), M(X,Y))-, 


R(L(X,Y), M(X,Y)) = R(X? + XY + Y2)5, X?Y?(X +Y)?) 
= [R(X? + XY + Y°, XY(X +Y))] - 1. 


Hence p divides the resultant 


R (SLM), a M)) = Discr(K,(L, M)). 


We note that we have actually shown that if r = (z?--xy--y?)? #0 
(mod p) and s = z?y?(x + y)? #0 (mod p) then (r,s) is a multiple 
root of the congruence K,(L, M) = 0 (mod p). 

From (2G) we obtain the following criterion: 
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(21) Let p be a prime number. If there exist nonzero relatively 
prime integers x,y,z such that x? + y? + z? = 0 then either 


p | zyz(z — y)(y — z)(z — x)(x° + y? + 2”) 


or p divides the discriminant of the polynomial 


ET) = F()- 5, SS 2i qu 


D 


where m — (p — 3)/2 and the summation extends from i — 0 to 


— 7 
— when p=1 (mod 6), 
— 

— 9 
Ue when p=-—1 (mod 6). 

PROOF. Suppose that x,y,z are nonzero relatively prime integers 
such that p /zyz(xz—y)(y—z)(z—x)(x?--y?4- z?) and a? +y?+z? = 0, 
hence p > 7. 

Then x? + y? + z? = 0 (mod p°), hence by (2G), p divides the 
discriminant of K,(X,Y), and more precisely, (r,s) with 


r = (xz? + zy +y’)? £0 (mod p), 
s = x°y"(x +y)? £0 (mod p), 


is a multiple root of the congruence K, (X,Y) = 0 (mod p). 
We write T = Y/X, so 


KAL Y/X)= F(T,  K,(X,Y) = X"F(T), 


where 
eX 
P. when p - 1 (mod 6), 
Sd when pz —1 (mod 6). 
But 
OK, 
= mX” F(T) — x"-—y r'(T 
Ke = mx" F(T) (n. 
OK, 


zx iU 
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and since (r,s) is a common root of the congruences 


OK, u 
72 (r,s) = 0 (mod p), 
OK, u 
OY (r,s) = 0 (mod p), 


letting t € Z be such that tr = s (mod p), then 


{ 0=mr™" F(t) — r"?sF'(t) (mod p), 
Q0 z r"-F'(t) (mod p), 


so F(t) = F'(t) = 0 (mod p). This implies that p divides the dis- 
criminant of F(T). 


F. Connection with a Cubic Congruence. In 1944, Pierre used 
a method somewhat similar to the one of Kapferer, to reduce Fer- 
mat’s equation to two systems of congruences. 

To begin, we need a lemma (part of these assertions were proved 
by Mirimanoff in 1907 and Skolem in 1937; see also Skolem (1941)). 


LEMMA 2.2. Let p = 6k +1 be a prime number, let a,b € Z, p y b, 
and consider the congruence 


X’ +aX+b=0 (mod p). 


Let t € Z be such that tb? = a? (mod p), and 


mk- (P — 5P = T) ies 
EE ere EE 


[p — (28 + D]lp (2s + 3)] --- [p (4s — 1)] 
226-0 (2s — 1)! 


+(—1)5 "(25 — 1) T*-s 


(1) If —(4t + 27) is not a quadratic residue modulo p, the con- 
gruence has a unique solution z, 0 « x « p. 

(2) If 4t + 27 = 0 (mod p), the congruence has two distinct so- 
lutions £1, £2, 0 < 21,23 < p. 

(3) If —(4t+ 27) is a quadratic residue modulo p and if V, (t) = 
0 (mod p) then the congruence has three distinct solutions 
T1, Lo, pa, 0 < 21,92, Ta < p. 
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(4) If —(4t + 27) is a quadratic residue modulo p, but V,(t) #0 
(mod p), then the congruence has no solution. 

(5) There are k values of t such that the congruence has several 
distinct solutions and 2k values of t for which the congruence 
has no solutions. 


In this respect, Cailler (1908) indicated a relation between the 
resolution of congruences of third degree and linear recurrences of 
second order; see also Mirimanoff (1909). 

Let F,(T) be the polynomial defined in (2H). With the above 
lemma, Pierre proved: 


(2J) Let x,y,z be relatively prime integers such that p Y xyz(x — 
y)(y — z)(z — x)(x? + y? + 2?) and x? + y? + 2? =0. Let r = (x° + 
zy t y), s = zy (zt yy. 
(1) If t satisfies the congruence rt = s (mod p?) then F,(t) = 0 
(mod p?). 
(2) If t satisfies the congruence r + ts = 0 (mod p) then 


(7) —1,  W,(t)=0 (mod p). 


PROOF. (1) By Chapter VI, (1C), x + y + z & 0 (mod p?). Hence 
x? pg? = —zP = (x +y)? (mod pt). Thus p* divides (x + y)? — 
x? — 4? = pzy(z + y)(z? + xy + y’) C(x, y) with e = 2 when p = 1 
(mod 6), e = 1 when p = —1 (mod 6). Since p Yz then p x + y. 

Similarly, since p /x?+y?+2? then p f z?--xy4-y?. Thus p? divides 
CL, y). With the previous notations, C,(r, y) = K,(r,s) = r"F,(t) 
(mod p?), with m = (p — 7)/6 when p = 1 (mod 6), m = (p — 5)/6 
when p = —1 (mod 6). Since p Kr then F,(t) = 0 (mod p). 

(2) We observe: the congruence 


(a? + ry +y’)? Ta^ y^ (x - y)! =0 (mod p) 
has a solution t if and only if 


0 (mod p), 


Pa? + tdx +t 
0 (mod p), 


dy + tdy +t 


with some d, not multiple of p. 
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Indeed, if dx, dy are solutions of X? + tX + t = 0 (mod p), then 
the other solution is congruent to —d(x + y) modulo p. Hence 


{ dry — x(x + y) — y(x +y)] =t (mod p), 
—dzy(x + y) = —t (mod p). 


Therefore 
d?(x? + zy + y?) =—t (mod p), 
{ d?ry(z + y) =t (mod p), 
so (x? + zy + y?)? + ta?y?(x + y)? = 0 (mod p). 
Conversely, let d be such that dry(x + y) —(x? + xy + y?) 
(mod p). Consider the congruence X? + aX + b = 0 (mod p) with 
roots dr, dy, —d(x + y). Then 


{ dry — x(x + y) — y(x + y)] =a (mod p), 
—dxy(x + y) = —b (mod p) 
Thus 
{ d(x? + xy + y?) = —a (mod p), 
dxy(x + y) = b (mod p) 


So b=a=t (mod p), hence 


d?r? + tdx +t =0 (mod p), 
dy? + tdy +t & 0 (mod p). 


It follows from the above considerations that the congruence X? 4- 
tX +t = 0 (mod p) has three incongruent solutions dr, du, and 
—d(x + y). Indeed, if, for example, dx = —d(x + y) (mod p) then 
x = —x(x + y) = z (mod p), contrary to the hypothesis. By the 
preceding lemma, —(4t 4- 27) is a quadratic residue modulo p and 
V, (t) 2 0 (mod p). 

'This concludes the proof. 


It is worthwhile to recall that if p = —1 (mod 6) then p Y z? + 
zy + y^, hence p Kai + y? + z? (see Chapter VI, Lemma 1.2). 

But Pollaczek has also shown in 1917 (see Chapter VI, (2J)) that 
even if p = 1 (mod 6), from z? + y? + z? = 0, p / xyz, it follows that 
p Va? + xy + uf, or equivalently, p Kai + y? + z?. 

Similarly, if æ = y (mod p) then zx? = y? (mod p°), z = —2x 
(mod pl, af = —2?x? (mod p°), so 2x? = 2?r" (mod p°). Thus if 
p Ÿ x then 2? = 2 (mod p°). Actually, Inkeri showed in 1946 that 
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2» = 2 (mod p?!) (see his paper). Hence, if p is such that 2? Æ 2 
(mod pt) then necessarily p Y (x — y)(y — z)(z — x). 

Wieferich showed in 1909 that if the first case of Fermat's last the- 
orem is assumed false for the exponent p then 2? = 2 (mod p°). This 
was the first of a series of criteria of similar type, discovered by Mi- 
rimanoff, Vandiver, Pollaczek, Rosser, and Granville. The proofs of 
these results are lengthy and nonelementary. For a fuller discussion, 
see my book 13 Lectures on Fermat’s Last Theorem. 


G. Criterion with a Determinant. In 1907, Bini used a classical 
recurrence relation to obtain an expression in terms of a determinant: 


(2K) Let x,y,z be any numbers, and let 


a=xr+yt+Z, 
b = ty + yz + 2x, 

C= XYZ, 
and for every n > 1 let Sn = x" + y" + 2”. Then: 
E Kä = aSn—1 + bS, 2 + CSn—3 = 0. 


2 
0 -a b —c 0 0 0 
0 1 -a b -—c 0 0 
0 0 1 -a b 0 0 
0 0 0 1 —a 0 0 
$,-d&| : : o; : 
0 0 0 0 0 b —c 
3c 0 0 0 0 —a b 
—2b 0 0 0 0 1 -a 
a 0 0 0 0 1 


PROOF. (1) z,y,z satisfy the equation 
X?’ — aX?’ +bX —c=0. 
Multiplying with X”"~*, we have 
X” — aX” 4 bX"? —cX"-3 — 0, 
Replacing X by z, y, z and adding the relations so obtained, we have 
Sn — On 4 + 5S, a — CSn—3 = 0. 
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(2) We write the above relations for k = 1,2,...,n. Thus 
$1,95,... , Sn satisfy a system of n linear equations: 


Sn x: QS, 1 + bSn-2 - CS, a SS 0, 


Sa = ass + bS2 = c9, = 0, 
Kë = as + bS, = 3c, 
$5 = as} = —2b, 
Si = €. 
The determinant of the coefficients of the system is equal to 1. Ap- 


plying Cramer’s rule, it follows that S,, is given by the determinant 
indicated. 


It should be pointed out here that Bini proved that this implies: 
if p is an odd prime and x + y + z = 0 then gyz divides x? + y? + 2’. 
But, as a matter of fact, this statement follows at once from the 
remark preceding (2A) of Chapter VII, noting that —z = x + y. 


H. Connection with a Binary Quadratic Form. In 1963, Pig- 
nataro linked Fermat’s equation with the representation of a pth 
power by means of a binary quadratic form. 

We first recall the following well-known fact. Fermat considered 
in 1657 the equation X? — dY? — 1, where d is a positive integer, 
but not a square (this equation has been erroneously named after 
Pell). Fermat stated that he had proved by the method of descent 
the existence of infinitely many solutions in integers. However, the 
first published proof is due to Lagrange, around 1766. Explicitly the 
result is the following: 


LEMMA 2.3. Let p be a positive integer, but not a square. There 
exists a solution in positive integers (1,1) for the equation X? — 
dY? = 1 such that: (x, y) is a solution in integers if and only if there 
exists an integer m such that x +yvd = (xı +y vd)”. In particular, 
for different values of m one obtains different solutions and therefore 
X? — dY? = 1 has infinitely many solutions in positive integers. 


The proof of Lagrange used continued fractions. Another proof 
may be found, for example, in Ribenboim (1999). 

We introduce the following notation. If b,c are nonzero integers, 
let (b,c) = bX? + cY?. 
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(2L) Let p be an odd prime, and let x be an odd positive integer 
which is minimal such that there exist positive integers y, z satisfying 
xP + yP = zP. Then: 


(1) yz is a quadratic residue modulo x; and 
(2) x? is represented by the quadratic form (z,—y) in infinitely 
many ways. 


PROOF. (1) We first note that since x? + y? zf and z is minimal 
then x,y,z are pairwise relatively prime. We write 


2 2 
= pp =z (ee) —y (gee) 


so (z, —y) represents z^. Also, z (2@-/2)? = (y@-0/2)? (mod x), 
hence yz (26-0/2) = y?*! (mod x), so yz is a quadratic residue 
modulo x. 

(2) Now we show that y, z cannot be both squares. Otherwise 
y = yi z = z2? with 71,2; > 0, so 


xP = xP — yi” = (2? — yh) (4 + yi). 


Since x is odd and gcd(y, z) = 1 then gcd (2? — y}, 2? + yf) = 1 and 
clearly 27 + y} > 1. Hence z? — y} is a pth power of an odd positive 
integer x’, 0 < 2’ < x, which contradicts the minimality of x. 

As seen above, x? = za? — yb? where a = z(?-9/2, b = y(»-07/2, 
Consider the equation X? — yzY? = 1. Since ged(y, z) = 1 and y,z 
are not both squares, then yz is not a square. 

By the above lemma, there exist positive integers u1, vı such that 
1 = uj — yzvi and moreover for every integer m if tm + vs /yz = 
(ur + v1,/yz)™ then 1 = uł, — yzv?,. We note that if 1 < m then 
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Um < 44 and Un < v,44. Then 


a? = (za? — yb’) (uz, — yzum) 


CEE 

eH) Gomme) om) (ere 
=z [oun + bo, y) — (Amz + GE 

[oun + bus) + (avg z + GE 


= z(au,, + bv y)? — y(aumz + bum)”. 


This shows that x? is represented in infinitely many ways by (z, —y). 


I. The Non-Existence of Algebraic Identities Yielding So- 
lutions of Fermat’s Equation. In 1895, Jonquiéres investigated 
whether there would be algebraic relations connecting hypothetical 
solutions of Fermat’s equation. 

If n = 2, we have the algebraic relation 


[Xe g^ (X? Vee = (2XY)? 


and, as was indicated in Chapter I, (1A), this yields all primitive 
solutions of the Pythagorean equation. 
We shall see that nothing of the kind exists if n > 2. 


LEMMA 2.4. Let F = Fo + Fi + Fo +- € Z[X,Y] where F; is the 
homogeneous part of degree i of F. 

If n > 1, there exist homogeneous polynomials of degree n, P> € 
Z|Xo,Xi], P € Z[Xo, Xi, X2],... (depending only on n) such that 
the homogeneous parts of F” are: 


(F")o = Lui, 


(F1 = n(F)" EF, 
LTL = n( Fo)" F, + Po(Fo, Fi), 
(F")3 = n(Fo) 1E + P3( Fo, Fi, F5), 


(F"), 2 no)" 1 F, + P(Fo, FA, ... , F1). 
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PROOF. By raising F to the nth power, we note that the homo- 
geneous part of (F"), consists of n(F)"-*F, plus a contribution 
involving the homogeneous parts of degree less than k. This is a 
polynomial expression, with multinomial coefficients, independent 
of F, depending only on n. 


(2M) If n > 2, there do not exist polynomials F,G € Z|X,Y] 
such that X"Y"^ = p" —G". 


PROOF. Let F = Fo + Fi + Fo +- , G = Go + Gi +G +: where 
F,,G% are the homogeneous parts of degree k of F, G, respectively. 
If F” = G” = XY then (F°) = (Ge for k = 0,1,... ,2n — 
1, (F")s, a (G")5, = XPY n By the lemma, Fo = Go, Es > Pn = 
Gon_1 and hence also 


NF)" Fon = (F")5, SCH P (Fo, F, Arr Ps) 
(G")5, + X"y" = Pon(Go: Gi, Sie Goya) 
= n(G,) 1 Gs, + X^Y^, 
) 


so n(Fo)" ^! (F5, — Gon) = X"Y". This implies that n = 1, which is 
a contradiction. 


We recall from Chapter VI, (3C), that ifn >2,f0<r<y<z 
are integers such that x" + y" = z”, then y is not a prime power. 
So y = ab, with a,b > 1, gcd(a, b) = 1. The above result tells that 
it is impossible to find polynomials F, Œ € Z|X,Y], such that for all 
a,b E€ Z a"b" = |F(a,b)|" — [G(a,b)]". 


J. Criterion with Second-Order Linear Recurrences. We re- 
phrase the results in Chapter III, (1A), (1B), (1C) in terms of second- 
order linear recurring sequences (see Kiss (1980)). We recall that if 
m > 1, if (Rx)x>0 is a sequence of integers, then r(m) is the smallest 
index r such that m divides R,. 


(2N) Let p be an odd prime, let x,y,z be relatively prime integers 
such that x? +y +2? =0. Let A = x—y, B = —-xy, Ro = 0, Ri = 1, 
and for every k > 2 let Ry = AR, 4 — BRy a and D = A? — 4B = 
(£ +y). 
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(1) If p yz then R, = dP, where d is an integer, R, = D@-V? = 
1 (mod p?) and d = 1 (mod p), r(d) = r(d?) =--- = r(d?) 
p (with the notation of the lemma). 

(2) If p | z then R,/p = d, where d is an integer, R, = p 
(mod pt), D = 0 (mod p*?7?) and r(d) = r(d?) = ++. = 
r(d?) = p, r(d"*!) # p. 


PROOF. (1) The roots of X? — AX + B are z, —y. By Chapter V, 
Lemma 2.1, Rp = (x? + y)/(z + y) = d (with d > 1), x +y = c, 
as follows from Chapter III, (1A), where c = t, d= t = 1 (mod p), 
by Chapter III, (1B), (with the notation previously used). Then 
R, = d? = 1 (mod p°). Since p / z and x + y + z = 0 (mod p) 
then p x + y so p fc. Hence D(»-)/2 = (y + y)e*1 = PY = 1 
(mod p°). 

Finally, since d? | R, then by Chapter V, Lemma 2.2, r(d), r(d?), 
«7 ,r(d?) divide p; since d > 1 then r(d’) 4 1, so r(d) = r(d?) = 
... = Tod?) = p. 

(2) Assuming that p | z, it follows as before from Chapter II, 
(1B) and (2C), that R, = (x? +y)/(x + y) = pd”, where d= t = 1 
(mod p°), z + y = p'e where n 22, c= t. Then R,/p = d? =1 
(mod p?) so R, = p (mod p^). Also, D = (x + y)? = 0 (mod p#?-?). 

Finally, since d? | E, but d?*! Y R, then r(d) = --- = r(d^) = 
p, r(d?*") 7 p. 


And now we give a result of Kiss and Phong (1979), containing 
an interpretation of the congruence q?^! = 1 (mod p?) in terms of 
an appropriate recurring sequence. 


(20) Let p,q be distinct primes such that p #4 2 and p f q — 1. 
Let A = q+ 1, B = q, Ro = 0, R, = 1 and every k > 2 let Ry = 
ARy_1 — DR Let D = A? — 4B = (q—1)?. Then the following 
statements are equivalent: 

(a) o "el (mod p°); 

(b) si, and 

(c) r(p) = r(p’). 
PROOF. (a) 5 (b) The roots of X? - AX+B=X?-—-(q+1)X +q 


are a = q, B = 1. By Chapter V, Lemma 2.1(1), Rp-1 = (g^! — 
1)/(q — 1). Since p /q—1 then p? | R,-1 if and only if p? | q?^! — 1. 
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(b) ^ (c) By Chapter V, Lemma 2.2(5), r(p) divides p—(D/p) = 
p— 1, since D is a square. Let p —1 = sr(p). By Chapter V, Lemma 
2.2(8), we have Rp1 = Ray) = SR, Rig (mod Rz). 

Since p | Run then p? | Ria also by Chapter V, Lemma 2.2(2), 
p | Rap p Y Rep 41 and p fs (since s € p — 1). Hence p? | Ry if 
and only if p? | Ry. Let v (R5) = k > 2. 

By Part (6) of Lemma 2.2 of Chapter V, r(p*) = r(p). But since p | 
Rr) then r(p) | r(p?). On the other hand, 2 € k, hence p? | Ra = 
Rp), hence r(p?) | r(p), showing the equality. And conversely, if 


r(p) = r(p?) then p? | Rt. 


K. Perturbation of One Exponent. The following result (see 

Schaumberger (1973), Klamkin (1974)) is rather a curiosity; it tells 

that if one *perturbs" even slightly one of the exponents in Fermat’s 

equation, the new equation has infinitely many solutions in integers. 
We first show: 


(2P) If a,b,c are integers, a,b,c > 1, gcd(ab,c) = 1, then the 
equation X° + Y^ = Z° has infinitely many solutions in integers. 


PROOF. It is trivial if c = 1. Let c #1. We note that there exist 
integers d,e such that abd + 1 = ce. We have d ¥ 0, otherwise 
ce = 1,c > 1, hence c = 1, which has been excluded. So, there 
exists an integer t such that d+tc > 1 and ab(d+tc)+1 = c(e+abt) 
with e+ abt > 1 since c > 1. Thus, there is no loss of generality to 
assume that d > 1, e > 1, and abd + 1 = ce. 

Let u > 1 be arbitrary and let 


x = bd, bc 
SCH , 
| y= gadac 
= , 
b 
z= 2u”. 
Then gt ty = gabd abc + 2abd, abc EET Qabd 1, abc Sg JQeeq abe zz zt. Since 


u is arbitrary, the given equation has infinitely many solutions in 
integers. 


It follows at once: 


(2Q) Ifn,k > 1 then X" +Y” = Z"*V* has infinitely many 
solutions in integers. 
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PROOF. First we note that (x,y,z) is a solution of X” +Y” = ZF"*1 
if and only if (x,y, z*) is a solution of X" + Y^ = Z"*V*, Taking 
a=n,b=n,c=kn+1 in (20), we see that the given equation has 
indeed infinitely many solutions in integers. 


L. Divisibility Condition for Pythagorean Triples. In 1913, 
Niewiadomski considered the polynomials in three indeterminates 


Do = Do(X,Y, Z) eL 
and if n > 1, 
D, = Da(X,Y, AE EE e 
He observed the identity, for n > 1, 
(2.2) Dayı — (X -Y)D, + XY Dh = Z"(Z - X)(Z - Y), 
which may be verified at once. 


With this identity, Niewiadomski and Métrod proved (1913): 


(2R) Let x,y,z be positive integers such that x? + y? = z?. Let 
d, = D,(r,y,z) for all n > 1. Then 2d, is divisible by d? when 
n 2. 


PROOF. We may assume without loss of generality that x,y,z are 
relatively prime. Indeed, let e = gcd(z, y, z), z' = x/e, y =y/e, 2 = 
z/e. So x? +y? = z? and we may assume that 2’ is even, while 
y', z' are odd. Let d = D,(2’,y’, 2’), so d, = e"d,, in particular 
d, = ed, and we note that d Z 0. Hence 2d,/d? = e^-?2d' /d!?. 
Since n > 2, it suffices to show that 2d’, /d/? is an integer. 

With the assumption that e — 1, by Chapter I, (1A), there exist 
integers a, b with 0 < b < a, gcd(a, b) = 1, such that 


x = 2ab, 
y= a? E b’, 
z = a? +b. 
If n = 2 then d; = 0 is divisible by d. Similarly by (2.2), 


d3 = (x + y)da — xydi + z(z — z)(z — y). 
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Hence noting that dı = 2b(b — a), we have 


2d; = —4ab(a? — Kid + (a^ + b’) (a — b)? 4b? 
[2a(a + b) + a? + El = [(a + b)? + 2a?]d?. 


Assuming that 2d,_; and 2d, are divisible by d?, it follows from 
(2.2) that 


2dn = (£ + y)2d, — zy-2d, + 22^! (z — z)(z — y) 


is also divisible by d? because 


22 (2 — z)(z — y) = (a? + Bb?) (a — MA = (a? + y". 
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IX 
Interludes 9 and 10 


We shall need the Gaussian periods and the Lagrange resolvents and 
Jacobi cyclotomic functions in the study of Fermat’s congruence. 


IX.1. The Gaussian Periods 


Let q be an odd prime, p a primitive gth root of 1, h a primitive 
root modulo q, L = Q(p), B = Ziq], and let 7 be the generator of 
the Galois group of L | Q defined by 7(p) = p^. 

Every element a € L may be indifferently written in a unique way 
as 


q—2 q—2 | 
E H = LA 
a= Qip or as Q = Qj P 
i=0 j=0 


(with a;,a;' € Q); moreover, a € B if and only if each a;, a,’ € Z. 
Comparing these two representations, and noting that 


RER ptt £ -(1+ p+: +p’), 


it follows that ao = —aq-1)/2 and a; = a,’ — a(, ,jj5 Where i = hj 
(mod q) (for i 2 1,...,9 — 2). 
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In the present situation, if q— 1 — fr, the r periods with f terms 
(relative to p and 7 or h) are: 
Ho = pt p" p pet p m, 
dp Sp aep ae 


hei = ph Hp gt DEET p. 


We have ey ju; = —1. For every j, we write uj = uj, if 0 < 
jo < r — 1 and j = jo (mod r). The periods u; are conjugate to 
each other: 7’(u;) = 54; (for à = 0,1,...,q — 2, and any j). In 
particular, 7^(u;) = p for j —0,1,...,r— 1. 

Let L’ denote the subfield of L which is fixed by 7”, so [L : L'] = 
f, [L : Q] = f; the Galois group of L | L’ is generated by 7? and the 
Galois group of L’ | Q is generated by the restriction 7’ of 7 to L’. 
Let B’ denote the ring of integers of L’. 


(1A) 
(1) (uo, pas... ,r-1} is a basis of the Z-module B. 
(2) [= Q(Ho, paeem , Ari); B= Zlue, se idea]: 
(3) (1, p, p2,... , p+} is a basis of the B'-module B. 
(4) The polynomial of periods 
r—1 
(1.2) FQ = [GC - 22 
i=0 


has coefficients in Z and it is irreducible. 


PROOF. (1) The elements pio, J, . .. , Uu are linearly independent 
over Z: if 35 4 aiu; = 0 (with a; € Z), replacing each u; by its 
expression, we have a linear combination of p, p°,... , p? which is 
equal to 0, and with coefficients 0. ao,@1,...,a,;_1 € Z so each 
a; = 0. 

On the other hand, if a € B' C B, we may write a = lo gp 
with a; € Z. Since 7'(a) = a then 


q—2 q-2 
hitr x hi 
d Qip = d Qip 
i=0 i=0 
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and from the uniqueness of the expression, we deduce that 


Gg = Ar = tt = A f-1)r5 
Qj = Gr41 — 777 = A f-1)r41; 
Qr Q2r—1 ere Qq-2- 


Hence a = 37-5 fly. 

(2) Clearly L’ D Q(uo,... , 14 1) and B’ D Z|uo, ... , p, 1]. The 
converse follows from (1.1). 

(3) Let G(X) = TI (X — p”) be the polynomial whose roots 
are the summands of the period po. Then each coefficient of G(X) 
is invariant by 7”, hence it belongs to BN L/ = B’. 

Thus G(X) = X/+a,X/~!+---+a, and since pis a root of G(X), 
then pf = —(aipf-1+---+a,). So pf is a linear combination of 
1,p,... , p! -! with coefficients in B’. Multiplying the above relation 
successively by p, p? ..., we deduce that o/*1, p/*?,... , p171 are also 
linear combinations of 1, p, p°, ... , ef! with coefficients in B’. Thus 
every element of B = Z{p] is a linear combination of 1,p,... with 
coefficients in B’. 

So (1, p,... , p/- 1 is a system of generators of the L/-vector space 
L. Since [L : I/] = f then (1,p,... , p/ ^!) are linearly independent 
over LI. hence over B’. 

(4) The coefficients of F(X) belong to Q, since they are invari- 
ant by T; hence they are in B'NQ =Z. 

Since F,,(uo) = 0, the minimal polynomial of uo divides F,,(X); 
its roots are all the conjugates of Lo, so it must coincide with F,,(X), 
which is therefore irreducible. 


It is not true in general that 


Zlue, -.- i-i] = Zo] = +++ = Zip]. 
For example, let q = 13, f = 3, r= and k = 2. The periods are: 


We shall show that the unique expressions of 441, u3, i3 as poly- 
nomials in uo with rational coefficients require some non-integral 
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coefficients. Indeed: 


Ho =H +2}, 
Hopi = Ho + Hi + pa, 
Hola = 3 + pi + Hs, 


and 


Du = Hoki + Züotta = 6 + Ho + 341 + Aus 
6 + Ho + 3(—1 — mo — pa), 


hence 
p2 = 3(—Ho — 2uo + 3). 

From this we obtain 

Hı = Ho — 22 = 5 (2M + 315 + Auto — 6), 

Jis = —1— Ho — ji — H2 = Aug — Au — Spo). 
It follows from (1A) that given 4,7, 0 € i,j € r — 1, there exist 
integers Hu € Z(0 € k € r — 1), which are unique such that pip; = 
Elo ur. More precisely: 


(1B) We have the relations 


r—1 


3 nina = req — f (for 0<k<r-1), 


i=0 
where 


1 when f is odd and k — 0 or r/2, 


1 when f is even and k = 0, 
Nk = 
0 otherwise. 


PROOF. First we evaluate the product 


Ix ir f=" k+jr 
Hobe = (se | Ei E 
1=0 j=0 


Writing j =i+1 (mod q — 1) then the above product is equal to 
f-lf-1 


ns KR KS ph artt) 


1=0 i=0 
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Let 
f-1 
S hl" (3.-h**?7) 
im 
1-0 
If 1+ h***" 40 (mod q), there exists a unique t, 0 < t < q— 2, such 
that 1 + bk" = hi (mod q); hence ui is equal to the period ju. If 
1+ AFF = 0 (mod q) then w, = f. Therefore, we may write 


(1.3) Hoy = nf + Mg oko + mgipa i: ms isis 


with integers n, > 0, mio > 0,... , my, 1 > 0. Now we determine 
Nk. 

(I) If f is even and k = 0, let f = 2f’, then 1 + Af" = 0 
(mod q) since fr = q— 1. So jjj, = f. On the other hand, if 
0 € i « f and u; = f, then we have 1 + h” = 0 (mod q), hence 
2ir =0 (mod q — 1), that is, 2ir = mr f; but mf = 2i < 2f, m=0 
or 1. If m = 0 then i = 0, an absurdity because q is odd. Thus 
m = 1,i = f/2. Therefore in this case n; = 1. 

(II) If f is odd (hence r is even) and k = r/2, let à = (f — 1)/2. 
Then 1+ /2* (470/27 = 0 (mod q), so Gu us = f. On the other 
hand, if 0 < i < f and m;' = f, we have 1 + h’/?+"" = 0 (mod q), 
hence r + 2ir = mr f; thus mf = 1 +2i < 1-- 2f; it follows that m 
is odd, som = 1 and i = (f — 1)/2. 

(IT) We consider the remaining cases. If 1 + A**^" = 0 (mod q) 
then 2k + 2ir = mr f and 0 € r(mf — 2i) = 2k < 2r, thus mf — 2i = 
0 or 1. 

If mf = 2i < 2f then m = Oor 1, and k = 0. If m = 0 then 
i = 0, k = 0, an absurdity, since q is odd. Thus m = 1 and f is even, 
which is a case already studied. 

If mf = 24 --1 « 2f +1 then m is odd, m < 2, som = 1, f 
is odd, à = (f — 1)/2 and also k = r/2, which was Case (II) above. 
Therefore, in Case (III), ny = 0. 

Since pop, is the sum of f? terms of the form o! and since each 
period contains f such terms, all appearing with different exponents 
i,0<i<q—1,it follows that 


Ne + mo + maya d misa f. 
Applying the automorphisms 7*, we obtain from (1.3): 


Wl = Nef + mio; + Mpg Hipi +--+ + Mk n-iHr-1: 
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Hence, from 377-4 Uu = —1 we conclude that 377-0 pipi, = ny(q — 
1) — (mx,o + m3 +: + Mari) = niq —- f. 


IX.2. Lagrange Resolvents and Jacobi Cyclotomic Function 


We shall use the following notations: 


e p,q are prime numbers such that q — 1 = 2kp; 
e Ç = primitive kth root of 1; 

e g = primitive root modulo p; 
e 
e 


K = Q(0); 
A = Zi]; 

e o = generator of the Galois group of K | Q, defined by a(¢) = 
£5 


e p — primitive qth root of 1 ; 
e L= Q(p), B = Zip]; 
e 7 — generator of the Galois group of L | Q, defined by 7(p) — 


h 
p5 

Ho,-.. ,p-1: the p periods with 2k terms (relative to p, T); 
L' = Q(uo,... , Mp1) = Q(uo) = ++» = Q(up-1); 


B' = Z(uo, .. . , i-i]; and 
T! = restriction of T to L’. 


Q(p, C) 
Q(p) = pe 
mad L'(c) 
5 nd Q(c) =K 


We note that DM K = Q. Indeed, the prime q is totally ramified 
in L and unramified in K, hence it is both totally ramified and 
unramified in LA K, so LA K =Q. 

Thus Q(p,¢) is a Galois extension of K with Galois group iso- 
morphic to the one of L | Q and generated by the automorphism 7, 
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defined by 


=. 


Similarly, Q(p,¢) is a Galois extension of L with Galois group 
isomorphic to the one of K | Q and generated by the automorphism 
©, defined by 


ER? 


e 


It is convenient to work with indices, as we define now. 

If t is any integer, not a multiple of q, then there exists a unique 
integer s, 0 € s € q — 2, such that t = h? (mod q). s is called 
the index of t (with respect to h,q), and we write s = ind; (t), or 
simply s — ind(t) if there is no ambiguity concerning the choice of 
the primitive root h. 

For example, ind(1) = 0, ind(—1) = (q — 1)/2. If t = t' (mod q) 
then ind(t) = ind(t’) and if t, are not multiples of q then ind(tt’) = 
ind(t) + ind(t’) (mod q— 1). It is also clear that every integer s, 0 < 
s < q— 2, is an index, namely s = ind(h?). 

We define the Lagrange resolvent (a, 3),, where a € K, BEL, 
and 7 is the automorphism defined above: 


(21) (a,8), = B+ ar(B) + o? 17 (B) ++ o? 7*7 (8). 


We write more simply (a, 8) in place of (a, 3),. The element (o, 5) 
belongs to the field Q(C, p). We note at once: 


(2A) For every n and 0 € Q(p): 


C'F(QG.,8) = (CP, 8) | and — (C,8»* € Q(C). 


PROOF. 


CMOS BY) = TBE Cap) RO EE Ce Mt 8) 
= (Z”, B), 


since C? = 1. So t ((¢”, B)?) = (F(C, B))? = CP- (C, B)? = (C7, B)’. 
Since (¢”, B) is invariant by 7 then (¢”, 8)? € Q(C). 
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We consider the resolvents (¢”, o"). With the index notation in- 
troduced above, we may write 


q—2 q—1 
(2.2) ue p. = Y (pu = Kä Hee, 
u=0 t=1 


The complex conjugate of (C°, p"), is (OC, o"). = (C",p ").. A 
first result to record is the following: 


(2B) With the above notations, for every n = 1,2,...,p—1: 


(C pe = (CT, Mo) 7 = 0 


and it belongs to L. 
PROOF. 


(2.3) (C, P) Sep e sae ee éi 
= p+ Co ++ qe s 
d po ze p Jore (acm 


p2r-1 


n QD» 
TP 


d ye "E EE (o7 Dn ` 
= bo + Ca + FOR 
(CT, Mo) +! € Le 


Moreover, JC". Ho) # 0. Indeed, the periods Ho, Dn... , pi, 
which are a basis of L’ | Q, are still a basis of L'(C) | Q since this 
extension has still degree p. 


The theory of Gaussian periods, Lagrange resolvents and more 
general sums of roots of unity is very rich and important. We shall 
only indicate the results which will be needed in the sequel. 


(2C) fp fn, qm then (C^, p) = (C^, p) C "09, In particular, 
(Cp = (C, pe. 
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PROOF. ((^, p") = 5271 çrind(@)5mt, But ind(tm) = ind(t)+ind(m) 
(mod q — 1), hence 


q—1 
Cea = Ce) »» Cuan 


t=1 


q—1 
= Cee »» Coats) ge 


s=1 


SR C", p). 


(2D) [fp yn then (C", p)(C "^, p) = 


PROOF. 
q—1 GER . 
| CG ind(t w) E Ee uu 
t=1 s=1 


q—1q-1 
grind t)— ing(s Artes 


(ÉD up) 


s=1 t=1 


For each s let r be defined by the congruence t = rs (mod q). Since 
p|q— 1 then ¢4~' = 1, so the above sum is equal to 


—14-1 


Q 


b» Grindir) gere. 
s=1 r=1 
But q — 1 = 2kp so 
dl " p 
5 Ce (r) _ 35 a 2k »» C 
r=1 m=1 m=} 
hence we may add X42] (^ ^4) = 0 and write 
q—1q-—1 
n —n nind(n) (r 
(CPG uec. a RES 
s—0 r=1 
q—1 q— 
— grma) p d . 
r=1 s=0 


But 


x (41). _ J q whenr=q—l, 
24 ~ | 0 whenlzrzq-2, 


286 IX. Interludes 9 and 10 


hence (£^, g) (£7, p) = qCnind6-0 = grid 2 gene? 


(2E) Or, p)| =q when p Jm. 
PROOF. The absolute value of Am = ICT, p) is 
LE Aen Ann = (p Up) 
= (6, p) (C7, pp id 2 gore? 
= qe" =q, 
using (2D), (2C). 


= OCT 


Now we give an expression of the Gaussian periods in terms of the 


Lagrange resolvents: 


(2F) Ifp/n then 


OI 


Pa iny (p 
PRoor. We compute the right-hand side: 


Den = bere 


j=0 j=0 u=0 
—1 g—2 
= Y y oer n) gh 
j=0 u=0 
—1 q—2-n 
yr 
j=0 t=-n 
q—2—n p—1 
n E 
t——n Jeg 


But ae G = p when p | t, and equal to 0 otherwise. Thus the 


above sum is equal to 


q—2—n 
tin 
1 > DN | =m 
i=—n, pit 


X 
The Local and Modular 


Fermat Problem 


In this chapter we investigate some natural modifications of the origi- 
nal Fermat problem. In the first section, we search solutions in q-adic 
integers. In the following section we consider Fermat’s congruence. 


X.1. The Local Fermat Problem 


Our aim is to show that for every prime q, Fermat’s equation has so- 
lutions in nonzero q-adic integers. Our tool will be Hensel’s Lemma. 


(LA) For every prime q and every prime p, the equation X?--Y? = 
Z? has nontrivial solution in q-adic integers. 


PROOF. If p = 2, this is trivial, by Chapter I, (1A). So we may 
assume p # 2 and for convenience, we consider the equation XP + 
Y? + Z? =Q. 


First Case: q £ p. 

Let F(X) 2 XP + œ — 1; then X? +g -1 = XP-1=(X 
1)(X?7! -- X?7? ------ X +1) (mod q). Since 1 mod q is not a root 
of X?  X?^? +... + X +1 modulo q, by Hensel’s Lemma (see 
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Chapter V, (1T) and (1U)), there exists a g-adic integer o such that 
a = 1 (mod q) and o? + p! + (-1)? = 


Second Case: q = p. 

Let F(X) = XP + p — 1 and Go(X) = X — 1, Ho(X) = X^! + 
X? E... X1. By Chapter II, (4B), the resultant of Go and Ho is 
equal to R = Ho(1) = p, so v,(R) = 1. Since Go(X)Ho(X) = X? —1 
then F(X) = Go(X) Ho(.X) (mod p). But p > 3 > 2u(R), so we 
may apply Hensel's Lemma of Chapter V, (1T) and (1U). Thus there 
exist monic polynomials G(X), H(X) in Z,[X] such that G(X) = 
Go(X) (mod p’-'), H(X) = Ho(X) (mod p?!) and F(X) = G(X) 
xH(X). So G(X) = X — a where a is a p-adic integer, a = 1 
(mod p?-1), soa Æ 0 and F(a) = 0, that is, a? +p? +(—1)? = 


In the proof we obtained solutions in which one of the numbers 
was not a unit in the q-adic field. So it is natural to ask if there is 
always a solution in q-adic units. In the sequel we give results which 
may be found in Klósgen's paper (1970). 


(1B) Letn > 1 and let p be an odd prime. The following conditions 
are equivalent: 
(a) There exist integers x,y,z, not multiples of p, such that x?” + 
yP +2" =0 (mod p^*?). 
(b) For every m > 0 there exist integers £m, Ym, Zm, not multi- 
ples of p, such that zt, + y^, + z^, =0 (mod p"*!*™) and 
Tm41 = Im (mod p Ym+1 = Um (mod pe 
(mod p"*!). 


, Zm+1 = Zm 


PROOF. It suffices to prove that (a) implies (b) and we proceed by 
induction on m. From z?, + y^, + 2" = 0 (mod p'tit"), with 
integers £m, Ym, Zm not multiples of p, we may write x?” + y?” + 
27 = p'pttitm. with r' € Z. Since p |} zm, there exists an integer 
r such that rz? -! = r’ (mod p). Let z441 = £m, Ym+1 = Ym, and 
Zm4l7 Zm — rp". Then 


2 


E E gb. pagi oem) SE 


n 
p” DE We p"—1 n--1-4-m p p" —2,,2 ,,2(1--m) 
Zm+1 = Zm — 2m Tp + | js r p 
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n+l+m ere) 
D 


I p^ "E p'—1 
= zb o—zP rp (mod p 


because m > 0, p Z 2, so p"*?*" divides all summands but the first 
two. Hence 


n n n+1 n n n n 
p p p — y? p p p^ —1, , n+l+m 
m4 F Hait ch Zm+1 = Im F Ym + Zm — Zm rp 
= (r — pue 
= 0 (mod pm. 


We note that we may in fact take £m = £o, Ym = Yo for every 
m > 0. As a complement, we note the analogous result for p = 2: 


(1C) Letn>1. The following conditions are equivalent: 
(a) There exist odd integers x,y,z such that 


z^ py" LiT m0 (mod 2*7?) 


(b) For every integer m > 1 there exist odd integers £m, Ym, 2m 
such that 


cl +y +22 =0 (mod 2”+?+) 


and Zut) = Lm (mod 2™*1), y,,,, = Ym (mod 2+1), zat 
= Zm (mod 2#1), 


PROOF. The proof is quite similar. It suffices to note that if zm+1 = 
Zm — r2™+1 then 


7 ^ ^ 25 
2 en OMA 2"—] n4-14-m 2" —2,,292(14-m) 
Sat E A T2 de Z6 Ped 


= (5) 22 72s 0m S Gate 


Di rane 


2n 21 n+lt+m 
EE 7^ (mod 


E 


because 2^-1*2(0-) divides all but the first two summands and n + 
2-4 m € n — 1-4 2(1-- m) since m > 1. 


Concerning the solutions of Fermat's equation by p-adic units, we 
have: 
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(1D) ` Let p be an odd prime. The following conditions are equiv- 
alent: 


(a) There exist units a, B, y of Lp such that o? + BP +7? — 0. 

(b) There exist integers £o, Yo, zo, not multiples of p, such that 
x + yo + 20 =0 (mod p’). 

(c) For every n > 0 there exist integers Ln, Yn; Zn; not multiples 
of p, such that x? + y? + z? = 0 (mod p"*?) and fait = Ln 
(mod Ch Yn+1 = Un (mod Gers Zn41 = Zn (mod pss 


PROOF. (a) = (b) We write a = zo--o/p, B = yo+ Bp, Y = 20+ 
where £o, Yo, zo are integers, 0 < zo, yo, zo € p— 1, anda’, 8', Yy € Das 
Since a, D. a are units, then p / xoyozo. From a? + 8? + «y? = 0 it 
follows that xb + yj + 26 = 0 (mod p°). 

(b) = (c) This was proved in (1B). 

(c) = (a) The sequences of integers (£n)n>0, (Yn)n>0; (Zn)n>0 are 
p-adically convergent, since r,,; = £n (mod p”*!), Mun = Yn 
(mod p"*!), z,,, = z, (mod p"*!) for every n > 0. Let a = lim zn, 
B = lim yn, y = lim z,. Since x? + y? + z? = 0 (mod p"*?) then, at 
the limit, o? + 8? 4- y? — 0. 


By the above proof, the conditions of (1D) are equivalent to: 


(a’) There exist integers x,y, not multiples of p, and a unit y € 
Zp, such that x? + yP + 4? = 0. 


Similarly, we have: 


(1E) Let q,p be distinct primes. The following conditions are 
equivalent: 


(a) There exist units a, B, y of 2. such that a? + BP +7? — 0. 

(b) There exist integers to, Yo, zo, not multiples of q, such that 
x + yo + 26 =0 (mod q). 

(c) For every n > 0 there exist integers £n, Yn, Zn; not multiples 
of q, such that x? + y? + z? =0 (mod q"*"), and an41 = x, 
(mod gu Yn+1 = Un (mod qt), Zn+1 = £n (mod qu 


PROOF. (a) > (b) We write a = zo--o'q, B = yo+ b'q, Y = zo4-"Y q, 
where zo, yo, Zo are integers, 0 € zo, yo, zo < q— 1, anda’, Olai € La. 
Since a, B, y are units then q Ÿ zoyozo. From o? 4-8? 4-4? = 0 it follows 
that x} + y5 + 25 =0 (mod q). 
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(b) = (c) We prove the statement by induction on n. It is true 
for n = 0 and we assume it true for some m > 0. Let £m, Ym, Zm be 
integers, not multiples of q, such that x? +y? +22, =0 (mod g"*1). 
Hence z?, + y?, + z?, = r’q™* where r’ is an integer. Since q | zm, 
there is an integer r satisfying the congruence rpz?. ! = r' (mod q). 
Let £m+1 = Zm, dud = Ym and Zm41 = Zm Sts Then z2] = 
(zm — rg" *1)? = zf, — pz? !rq'"*! (mod q+?) and 224, + y + 
(ucc oe dom pee gett ea EE gs e H 
(mod q’”*?). 

(c) = (a) The sequences of integers (%n)n>0, (Un)n»o: (Zn)n>0 
are q-adically convergent, since z,,; = rz, (mod q"*!), un = Yn 
(mod q"*3),2,,, = Zn (mod g"*1) for every n > 0. Let a = lim zn, 
B = lim yn, y = lim z,. Since x? + y? + 22 =0 (mod p"*') then, at 
the limit, o? + 8? 4- y? — 0. 


As in (1D), the conditions of (1E) are also equivalent to: 


(a") There exist integers x, y, not multiples of q, and a unit + € Le 
such that z? + yP +% = 0. 

We conclude by noting that Fermat equations (for n > 3) provide 
an interesting example where there is a nontrivial solution in every 
g-adic field (by (1A)) and only the trivial solution in integers, as 
proved by Wiles. 
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X.2. Fermat Congruence 


We shall study the congruences 
(2.1) X"™+Y"+4+ 2" 
(2.2) X” +Y” 


0 (mod q), 
Z” (mod q), 


where q is an odd prime number, n > 0, and q does not divide n. 
Let N(n,q) = Au.) | lo ww < q, £” +y” +2” 


0 
(mod q)} and N'(n,g) = #{(x,y,2) |1<x,y,z < q, tu = 2” 


2 


292 X. The Local and Modular Fermat Problem 


(mod q)). Clearly, if n is odd then N(n,q) — N'(n,q). In this 
connection, we consider the following problems: 


(1) To determine when N(n, al, N’(n,q) are greater than 0. 
(2) To find upper and lower bounds for N (n, q), N'(n, q). 
(3) If possible, to calculate the values of N (n, q), N'(n, q). 


We recall from Chapter IV, (2A), that if gcd(n, q—1) = 1 (hence n 
is odd) then N(n,q) = N'(n,q) > 0. We have also shown in Chapter 
IV, (2D), that if p and q = 6kp + 1 are primes then N(p, q) > 0. 

The following implication was proved by Libri (1832, p. 275) and 
again by Pepin (1880), Pellet (1887), and Matthews (1895). 


(2A) Let p be a prime number. If there exist infinitely many 
primes q such that N(p,q) — 0 then Fermat's last theorem is true 
for the exponent p. 


PROOF. Assume that there exist nonzero integers x,y,z such that 
x? +y? +z? = 0. If ois any prime number such that q > max{|x|, |y], 
|z|} then x? + y? + 2? =0 (mod q) and N(p,q) > 0. So N(p,q) 20 
for only finitely many primes q, proving the statement. 


This result shifts the proof of Fermat's last theorem to the proof 
that N(p,q) — 0 for infinitely many moduli q. The fact is that we 
shall actually prove the opposite, namely for every p there exists a 
prime qo(p) such that if q > qo(p) then N(p,q) > 0. 

Before proving this theorem, we describe some of the numerous 
special results concerning these congruences. 

Legendre (1830) showed that N(3, 7) = N(3,13) = 0 and N(5,q) 
= 0 for q = 11,41, 71,101. 

Libri wrote a series of papers (1824, 1832) in which he exposed a 
method to compute the number of solutions of very general congru- 
ences. Libri calculated N(3,q) for many primes q = 1 (mod 3) and 
showed that there exists qo(3) such that if q > qo(3) then N(3, q) > 0; 
it should be noted that some of his calculated values were incorrect. 
These results were published again by Pepin in 1880 (see also his 
paper of 1876). 

Since q = 1 (mod 3) then —3 is not a square modulo q and there 
exist integers l,m of the same parity such that 4q = l? + 3m?. This 
may be explained by considering the decomposition of q as a product 
of elements in the field Q(4/—3). From the fact that —3 is not a 
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square modulo q then q = aa’ where a = (1+ m/—3)/2, « = 
(l — m4/—3)/2 with l,m integers, both even, or both odd, so 4q = 
P + 3m?. We choose a representation with minimal |||. We note 
that | is not a multiple of 3; by changing l into —1 (if necessary) we 
may assume that l = 1 (mod 3) and this implies that | is uniquely 
defined with the above property. Pepin showed: 


(2.3) N(3,q) = (a — 1)(a— 84 1). 


Here are some numerical examples: 


28 =1+3x3? = N(3,7) 26(7 — 84-1) 20; 

52 =524+3x32 = N(3,13) = 12(13 — 8 — 5) = 0; 

76 =1743x5? = N(3 19) = 18(19 — 8+ 1) = 216; 

12 4=4?+3x6? = N(3,31) = 30(31 — 8 + 4) = 810. 
N( ( 


Pepin noted that since | > —2,/q then N(3,q) > ,/q(,/a — 2) — 
Hence if q > 19 then N(3,q) > 0. 

Libri established that for every prime p there exists qo(p) such 
that if q > qo(p) then N(p,q) > 0; however, he gave no bounds for 
N(p,q) or a way of computing qo(p). Pellet used another method to 
show the same result in 1887; in a later note (1911), Pellet gave a 
bound for N(p,q), but his value was erroneous. 

In 1837, Lebesgue considered arbitrary polynomial congruences 


(2.4) F(X,,..., Xn) =0 (mod q), 
where s > 2, q is an odd prime, and F € Z[X,,... , X;]. Let 


Ned m6 vx [LE ere gl 
for all à = 1,...,s, such that F(zi,... ,2,) 2 0 (mod q)}, 

No = FE Gees ay) | 0 xm <q-l 
for all à 2 1,...,s, such that F(x:,...,x,) 2 0 (mod q)}. 
In the polynomial Fu"! let A (respectively, Ag) be the sum of the 
coefficients of all monomials eX1' --- X$» such that q— 1 divides each 


e; (respectively, each e; is greater than 0 and divisible by q — 1). 
Lebesgue showed that N, No satisfy the congruences 


No = (-1)**" Ao (mod q), 
N = (—1)'(1— A) (mod q). 
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Lebesgue applied his method to congruences like 
(2.5) AX? +--+ AX? =0 (mod q), 


with n > 2, s > 2, each A; is a nonzero integer, g is an odd prime, 
q = 1 (mod n). He gave an expression for the number of solutions 
N in terms of the periods of the cyclotomic equation. Lebesgue 
studied in detail the following special cases of the congruence (2.5): 
s = 2, n = 2; s = 3, n = 3; s = 3,n = 4. For (2.1) with n = 3, he 
derived once more some of the results of Libri. 

Further results about the number of solutions of (2.5) appeared in 
Lebesgue’s paper of 1838. In 1909 two papers by Dickson appeared, 
and one each by Cornacchia and Hurwitz, dealing with these con- 
gruences. 

Cornacchia studied various special cases of (2.1) and (2.5) in detail 
and gave many explicit results, which had been in part previously 
indicated by Lebesgue, Pepin, and Pellet: 


(a) If n 2 2 and q = 1 (mod 4) then 


9 
—— when 2 is a square modulo q, 
N'(2, q) = 5 
—— otherwise. 


7 
—— when 2 is a square modulo q, 
Nep=t À, 
—— otherwise. 


(b) For n = 3, q = 1 (mod 3), Cornacchia obtained once more 
Pepin’s result and showed that if N(3,q) = 0 then q = 7,13. 

(c) If n = 4 and q = 1 (mod 4), Cornacchia calculated N(4, q). 
Moreover, he showed that N'(4,q) — 0 exactly when q — 
11,17,29, 41. 

(d) If n = 6 and q = 1 (mod 6) then N’(6,q) was also deter- 
mined. Moreover, N'(6, ol = 0 exactly when q = 7,13, 19, 43, 
61,97, 157,277. On the other hand, for the congruence X + 
Y9 + Zê = 0 (mod q) we have N(6,q) = 0 exactly when 
a = 7,13, 31,61, 67, 79, 97, 139, 157, 223, 277. 
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(e) If n = 8 and q = 1 (mod 8) then Cornacchia gave upper and 
lower estimates for N'(8,q). Moreover, N'(8,q) — 0 exactly 
when q = 17,41,113. And for the congruence X? + Y? + 
Z8 = 0 (mod q), he established that N(8,q) = 0 exactly 
when q = 17, 41, 113, 137, 233, 761. 


In his papers, Dickson dealt with the congruence (2.1). His method, 
involving the periods of cyclotomic equations, led to lower and upper 
bounds for N(p,q), as well as an upper bound for qo(p), namely 


qo(p) € (p — 1)*(p — 2)? + 6p — 2. 


As applications, Dickson showed that N(5,q) = 0 exactly when q = 
11,41, 71, 101 and that N(7,q) = 0 exactly when q = 29, 71, 113, 491. 
Using computations of Carey (1893) for squares and products of 
periods, Dickson applied his method to the congruence X*+Y+* = Z? 
(mod q). 

Hurwitz's paper dealt with the more general congruence (2.5) with 
n = p a prime. He considered the family of such congruences. For all 
possible values of the coefficients and indicated relations which must 
be satisfied by the numbers of solutions of these various congruences. 
From this information, Hurwitz deduced upper and lower bounds for 
the number of solutions of 


(2.6) AX? + BY? + CZ? =0 (mod q), 


where A, B, C are nonzero integers. He also determined a positive 
number qo(p) (depending on equation (2.6)) such that if q > qo(p) 
then (2.6) has a solution (x,y,z), with 1 < z,y,z € q-— 1. 

In 1917, Schur gave a proof that for every n > 2, if q > (n!l)e +1 
then X" + Y" + Z” = 0 (mod q) has a solution (x,y,z) with 1 < 
1,9,z € q— 1. Schur's proof was based on the following interesting 
combinatorial lemma: 


LEMMA 2.1. Let n > 1 and N > (n!)e +1. If the set of numbers 
(1,2,... , N) is partitioned into n disjoint subsets L,,..., Ln, there 
exists at least one subset L; such that if m,m € L;i with m < m 
then mi — m € Lj. 


Among further developments, we want to report that the equa- 
tions 


AX? + BY! CZ? 20 
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and 
AIX + Ag X53? +... + AXES =0, 


with coefficients in a finite field with g^ elements (d > 1) and not 
necessarily equal exponents, have been the object of numerous pa- 
pers. See Mitchell (1917), Vandiver (1944, 1945, 1946, 1947, 1948, 
1949, 1954, 1955, 1956, 1959), Hua and Vandiver (1948, 1949), E. 
Lehmer and Vandiver (1957). For the vast, more recent, literature, 
the reader should consult the appropriate sections in Mathematical 
Reviews. 

In his paper of 1949, Weil traveled a historical panorama of the 
evolution of Gauss’ original method to deal, by means of Gaussian 
sums, with the congruence AX? — BY? = 1 (mod q), where q is a 
prime, q = 1 (mod 3). These ideas were applied subsequently to 
wide classes of congruences. They were used by Hardy and Little- 
wood in connection with Waring’s problem; Hasse expressed rela- 
tions between Riemann’s hypothesis for function fields and various 
kinds of exponential sums, and Weil published striking and definitive 
results on this question (1928). 

Here we single out the following specific result (see Vandiver, 1946, 
pp. 47-52; Hua and Vandiver, 1948, pp. 258-263), which is more 
directly related with our subject matter. 

Let s > 1, let Aj,...,A, be nonzero integers, let n1,... ,n, be 
integers, let q be an odd prime, and let d; = gcd(q — 1,|n;|) > 1 
for i = 1,...,s. Let N denote the number of solutions in integers 
(z1,... ,2,) with 1 < x; € q—1 (fori = 1,...,s), of the congruence 
(2.7) Au En +++: + AX; =0 (mod q). 

Then 
(gr 
q 
In particular, there exists a positive number qo such that if q > qo 
then N > 0. Another proof of this theorem, using the theory of 
group characters, was given by Feit (1967). 

After this survey of results which are related to Fermat’s congru- 

ence, we shall give the proof of Dickson's theorem. For this purpose, 


we recapitulate the following notation and facts from Chapter IX. 
Let p > 3, q = 2kp + 1 be prime numbers. Let 


(qe TL 


ea < N < p dicc dig. 


g be a primitive root modulo p; 


X.2. Fermat Congruence 297 


¢ be a primitive pth root of 1; 
h be a primitive root modulo q; and 
p be a primitive qth root of 1. 


The p periods of 2k terms in Q(p) are 


mo = pt p + p He + gt tm 
m M p^ + p + ch p+1 Met pi AE 
(2.8) het 


TI ML DI E ps 
In Chapter IX, we defined Lagrange resolvents for every 7 = 0,1,..., 
p—l 


q—1 
(2.9) e) inda (0 5 


t=1 


where ind; (t) = s, 0 € s < q—2 when t = h? (mod q). In particular, 
No (1,0) = Lip = 1. 

For easy reference, we recall the following results from Chapter IX, 
(2D), (2E), (2F): For j =1,...,p—1, 
(2.10) Ah = 4, 

(2.11) IA; = AA = q 
(À; denotes the complex-conjugate of A;), 


OIS 


(2.12) 232 Hd, 


We shall give a simplified proof of Dickson’s theorem, following 
Klósgen (1970). First, we give an expression of N(p,q) in terms 
of the periods ue 


(2B) 


TEEN 
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PROOF. To begin, we note that if x,y,z are integers such that 1 < 
x,y,z < q— 1 then 


= picerty te?) — 0 when z? + y? + z? £0 (mod q), 
ud q when x? + y? + z? z 0 (mod q). 
So 
q—1 q—1 
qN(p.q) = prse 
æ,y,2=1 \t=0 
q— q-1 


1 
— tx? ty? tz? 
= | p^ p" p | 
t=0 \x,y,z=1 


Ift=h' (mod q) and x = h? (mod q) (where 0 € i, j < q— 2) then 


313p" = 51-5 ph” = pui. Since uj = pi; when j = i (mod p), 
then 
p-1 


qN(p,q) = (qa 1? - 2k S pur, 


i=0 


because q = 2kp + 1. Therefore 


1 
N(p.q) = : (q — 1? + (a — Dp? E 


And now, we prove Dickson's Theorem: 


(2C) We have: 


(1) (@-V[at+1—-3p—- (p- 1)(p - 2)/a] < N(p, 9) 
< (q— T)[a +1- 3p t (p — 1)(p — 2)/q]. 

(2) If q > (p—1)?*(p— 2)? + 6p — 2 then the congruence (2.1) has 
a nontrivial solution. 
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PROOF. (1) By (2B) and (2.11), 


Np) =i ja-a- or ia e 


1 exp err ZE zo in 
= (q D 3 ili tj2+js Ad, Aja Ajs 
q 130 dois ja 
=i 
EN 3 ga) \ i(jı+j2+j3) 
= - ME 3. Ap wks Sg . 
J1:92,93—0 
But 
=i : : : 
y Cristi) zm { 0 when Jı + J2 + J3 # 0 (mod p); 
p when jı + ja + j3 = 0 (mod p). 
Thus 


p—1 


1 
Nim, al zx q (q — 1)? & (q — 1) VE Au, À je Ajs 
dila Use Goals) 


Since Ay = —1, it follows from (2.9) that the last sum of products 
Aj Aj; Aj, 18 ual to 


p-1 
Nj +309 AjAp-j + S = —1— 3q(p — 1) + 9S, 
j=l 
where S = EE Ze be Aji Ae Aja. Hence 
1 
N(p,9) = la- * 1 — 3a(- 1) + 8]. 
id N 1 1 
Pd) _ 1 (@ -3gpcq-- S) - a- 3p 1 18 
gol q q 
and N(p,Q i 
=P —(q+1-3p)| = ISl. 
q— q 


By (2.10), |A;| = yq. We note also that for every jı, 1 < jı € 
p — 1, there are p — 2 pairs (j2,j3), 1 < j2,j3 € p — 1, such that 
jit ja ja = p or 2p, namely (1,p— jı — 1), (2,p—3ÿ1—2),..., (p— 
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jı—1,1), and (p—1,p—jı+1), (p—2,p—ji+2),.. ES (p—ji 1, p- 1). 
Hence |S| < (p — 1)(p — 2)g?^, so 
N(p.q) 


qd — (ac 1—3p) < (p— U(p — 2)/a 


and we conclude that 


(q — 1)[g - 1 — 3p — (p — (p — 2)q] < N(p, q) 


< (a- 1)lg * 1—3p * (p — 1)(p - 2) 4]. 
(2) For later use, we prove more generally that if v is an integer, 
v > 0, and if q > (p—1)?(p— 2)? -2(pv — 1) then g+1—(p—1)(p— 
2) /q — pv > 0. Choosing v = 3 this gives the statement (2), in view 


of (1). 


'The inequality 


o? +28 > ava? + A8 
(for real numbers a, 8), which is easily verified, implies when a = 
(p — 1)(p - 2), 8 = pv — 1: 


(p — 1)*(p — 2)?  2(pv — 1) 
> (p -1)(p - 2)y (pl (p - 2)? + 4(pv — 1). 

) 

) 


Let 6 = (p—1)?(p—2)?+4(pv—1) > 0 and consider the polynomials 


TOP ep -—(p-1)(p-2)T — (pv — 1), 


having discriminant 6. 
It suffices to show that 


> eevee 


indeed, this implies that f(,/q) = 0, that is, q--1— (p—1)(p—2) /q 
pv > 0. We have 
4g > 4[(p — 1)*(p — 2? + 2(pv — 1)] 
2[(p — 1)*(p — 2)? + 2(pv — 1) + (p — 1)(p — 2) v6] 
= (p - 1) (p - 2? +ô + 2(p — 1)(p — 2)V6 
= [p - 1)(p — 2) + V6), 
so /q > ((p — 1)(p — 2) + V6) /2, as was required to show. 


IV 
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Taking v = 3, it follows from (1) that 


N(p,q) > (a —1)[(q +1) — 3p — (p — 1)(p — 2)/4] > 0. 


The upper bounds for qo(p) given by Dickson are not sharp, as we 
see by explicit computation: 


Dickson bound Actual value 
qo(3) € 20 qo(3) = 13 
qo(5) € 172  qo(5) = 101 
qo(7) € 940  qo(7) = 491 


As a complement, Mantel showed in 1916 that if N (p,q) = 0 then 
q must be of the form q — 6mp/(p — 3) — 1 (for some integer m). 

Taking into account (2C) and the result of Chapter IV, (2B), if 
k>1, q=2kp+1 isa prime and q > (p — 1)?(p — 2)? + 6p — 2 then 
q divides the Wendt determinant W5,. Therefore, for each prime p 
there exist at most finitely many integers k > 1 such that q = 2kp+1 
is a prime not dividing W24. What is not known is whether, for every 
prime p, there exists actually one prime q with the above property 
(see Flye Sainte-Marie, 1890, and Landau, 1913). 
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X.3. Hurwitz Congruence 


In this section we give the theorem of Hurwitz concerning the con- 
gruence 


(3.1) A,X? +--+ A,X? =0 (mod q), 

where p > 3, q = 2kp + 1 are prime numbers and A,,..., A, are 
nonzero integers. Let N = N(Aj,...,As,p,q) be the number of 
nontrivial solutions of (3.1), i.e., of (z1,... , £s), with 1 < x; < q— 1, 


such that $7;., Aur = 0 (mod q). If h is a primitive root modulo 
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q, let a; = ind, (A;) (with 0 € a; € q — 2), so A; = h% (mod q), for 
V 5:625 

Thus N is equal to the number of (t4,... ,£,), 0 € t; < q—2, such 
that $7; , hetta = 0 (mod q). 

We consider the function x : Z — {0,1} defined by 


_ f 1 whenq|z 
dcr when q yz. 


Then 


(8.2) aes dee. 


di, ts=0 i=1 


Noting that if t; = t/ (mod 2k) then hin: = hrtitai (mod q); so 
(3.2) may be rewritten as 


(3.3) N=p* > X E ine) 


tise =0 i=1 
For convenience, we introduce the following “symbol”: 


2k—1 


(3.4) CEE 2 es), 


ts=0 


which is a nonnegative rational number. We may rewrite (3.3) as 
follows: 


(3.5) N = 2kp°|ai,... al = (q — 1)0" [01 6]; 


and to determine N we are led to study the symbol [a;,... , as]. 
First we note that [a4] = 0, since q /h?" 1, for every tı, 0 < tı < 
q — 2. 


LEMMA 3.1. 
| j= 1 when a, = az (mod p), 
0102/7715 0 when a, Z a (mod p). 
PROOF. q divides h” t% + hptitaz if and only if h?h *«: = — pptzte2 


= h(a-0/2*rts*22 (mod q); this is equivalent to pt, +a, = (q— 1)/24 
pt2+a2 (mod q—1). Now if a, = a3 (mod p), say ag = Mmp+a. For 
every 15, 0 € t < 2k—1, let tı, 0 < tı € 2k—1, be the unique integer 
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such that tı = k +m + tz (mod 2k). Then (q — 1)/2 + pts + a2 = 
p(k + t5 +m) +a, = pt; + a, (mod 2kp). So 


2k—1 


5 x( (hrii to + htta), 


ty ,t2=0 


EI az] = 


2k 


Conversely, if for some ty, 0 € tə < 2k — 1, there exists tı, neces- 
sarily unique, such that 0 € tı < 2k — 1 and pt; +a, = pta + a2 + kp 
(mod q — 1) then p(t; — t2 — k) = a3 —a1 (mod 2k), hence p | a2 — a1. 
So if a4 os (mod p) then x(Ah?^ **: + prit) = 0 for all ti, t2, so 
ar, ay] = 0. 


Now we collect some easy facts about [a1, ... , ds]: 
LEMMA 3.2. (1) The value of |a1,... ,a.] remains unchanged by 
any permutation of a4,... ,@s. 
(2) lo, gel = [ai,...,a!] whenever a, = a, (mod p),..., 
as =a’, (mod p). 
(3) [a1 +u,... , as +u] =[ai,..., as] for any integer u. 
PROOF. (1) This is obvious from the definition of [ai,... , as]. 


(2) Indeed, let a; = pr; + a and let t; + r; = t, (mod 2k) where 
0 <t € 2k — 1. Then hett") = bt: (mod q) and 


CONO A Es (Soe) 


EU i=1 


2k—1 
1 


2k, 2 lee dei 
1 2k—1 
ACE (Se) cri 


et 1—1l 


(3) Since bn Z 0 (mod q) then for every t; = 0,1,...,q—2: 


X (>: ium =x E Sea ; 
i=1 i=1 


hence [a4 +u,...,a; +u] = [a1,... , as]. 
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LEMMA 3.3. Jo... ,a,] is a nonnegative integer and equal to 
2k—1 
[a,...,a] = A Mim +... Aptos tert 4 pe) 
tyre th SEO 
2k—1 
= Kä (heita pesti eei po) 
H st) _,=0 
PROOF. Given t),... ,t, 4, 0 € t; € 2k — 1, for every t, Jett, 


t, , be such that 0 < t'; € 2k — 1 and t; = tı — t, (mod 2k). Noting 
that bit: Z 0 (mod q), we have 


fées gel 
1 k— k—1 s 
= OR ps X E i) 
ts—_1=0t,;=0 i=l 
1 2k—1 2k—1 


racio 5 5 x(h?* (ht tar Ai 4 ppt atest +R") 


2k, 
test, Q0 ts=0 
1 2k—1 
= —2k hetitai 4 petta A... 4 petitas- 4 pas), 
sch 2x x( + Te + he) 
t tn 40 
Hence [a;,... , as] is a nonnegative integer. For the last equality, we 


note that a, = a, + (q — 1)/2 (mod p) and hG@-1/2 = —1 (mod q). 
'Then, using Lemma 3.2 and what we have just proved, 


q-— 
lai,- tts] fas. ,As—1, A, + ——— 


2 
2k—1 
= 5 x (AP te Hess + pPtsi test _ he): 
ti... 0 
LEMMA 3.4. If r,s > 1 then 
[ag ies ; Qs, b... ; bh, 
p—1 
= 2k|a,... ; as|[b;, . .. , bh, tes. ó as, C|[bi,... brc]. 
c=0 


PROOF. Let A; =h™ (mod q), B; = h (mod q). Then 
Dh laus ses day bip... , br] =N, 
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the number of solutions (z1,... , $,,y1,... , Yr) with 1 € z;,y; € 
q—lof»;5. AX; +575; B;Y; =0 (mod q). Let N' be the number 
of solutions (z1,..., £s), with 1 < x; < q— 1 of $5; , AX? = 0 


(mod q) and let N” be the number of similar solutions of 575 ., B;Y7 
= 0 (mod q). 

For every d = 0,1,...,q9— 2 let N° be the number of solutions 
(£1,... Pal 1 < x; € q— 1, of 


5 A,X? +h’ =0 (mod q) 


i=l 


and let N7 be the number of solutions (yi,..., Yr), 1 € y; <q—1, 
of 


XC B;Y”p —h?=0 (mod q). 
j=l 


Then N’ = Ep ones. N” = EDU [Dis su bal: 
2k-1 
Ge E x (Some +) =D" dass as d] 
Et i=1 
(as follows from Lemma 3.3), 
2k-1 
TP. E A SC rech — pi =p'|a,... ; Gr; d] 
E zt j=1 


(as follows from Lemma 3.3). 
We have N = N'N” + 575 NN". But 


lai, ... , as, d] = perse as, cl], Deo , br, d] = bises , brc], 


when d = c (mod p). Therefore 


2kp**"|ai, ... ETE bi, ees br] 
ed ers ces dslibisses , brl 
p—1 
+2kp°t" V "la, sheng as, C|[bi,... , br, C 
c=0 


and dividing by 2kp**", we have the required relation. 
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LEMMA 3.5. For any integers aj, G2, c: 
p—1 
3 [a 22, 0 + d| = 2k — [a4, ao]; 


d=0 


in particular, the above sum is independent of c. 


PROOF. 
p—1 p—1 
aas cd = D 4, a2] 
d=0 


d= 
p— 2k—1 
P? | S E EE 2) 


d=0 \t,u=0 


by Lemma 3.3. Let U, be the set of all u, 0 € u < 2k — 1, such 
that Ah?"**: + bus = 0 (mod q). Then the number of elements of 
Uy is #U, = Sol x (hP*** + h°) = Jan, gel, by Lemma 3.3. Let 
Uz be the set of all u, 0 < u < 2k — 1, not belonging to U;', so 
HU = 2k — Jo, a2]. We may write 


p—1 p—12k-—1 


2 laan cd = A, 2, 3, x(W e mtm + he) 
d=0 uEU: d=0 t=0 
p-12k-1 


d 5 Kä os RTE de puta + pe 


uEU2 d=0 t=0 


If u € U, then y(hPt+dte+ pta + h22) = y(hrttate) = 0. If u € Us, 
there exists exactly one couple (d, t) such that h”ttite4 hruta: p ha = 
0 (mod q). So Solo, a2, c+ d] = HU: = 2k — [a1, a]. 


LEMMA 3.6. For any integers a1, da, a3, Q4: 


p—1 
DC + d,a2 + d, as, a4] 


d=0 


= (q — 1)[a1, a») |a; a4] + (2k — [a1, a2])(2k — [aa, a4]). 
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PROOF. By Lemma 3.4 we have 


d=0 d=0 
p—1 
ag 5 [a4 + d, a2 + d, c|[as, a4, c] 


d,c=0 


p—1 Dr 
Y [ai + d, az + d, az, a4] = 2k | LEE) [a3, a4] 


= 2kplay, a] |a, a4] 


p-1 pl 
A SEA (Stes ei 

d'=0 c=0 
(where d' = c — d (mod p), 0 € d' € p — 1), by virtue of Lemma 
3.2. The above sum is therefore equal to (q— (lo, a;]|a3, a4] + (2k — 
(ai, a3]) (k — [a3, a4]), by Lemma 3.5. 


For any integers n,m let 
p—1 
(3.6) Onm = CR m + nd, 0]. 
d=0 
We have: 


LEMMA 3.7. Qom = 2k — [m,0], aim = 2k — [m, 0], Gaam = ewm 
when n =n! (mod p), m = m (mod p) and Y, anim = q — 2. 


Tri 


PROOF. By Lemmas 3.2 and 3.5, 


p—1 p—1 
dom = Y d. m, 0] = Y [m,0, d] = 2k — [m, 0]. 
d=0 d=0 
Similarly, 
p—1 p—1 p—1 
Qim = Y dm + d, 0] = y 0, m, d" = 3 Ion, 0, d" = 2k — [m, 0] 
d=0 d'—0 d'—0 
(where d' = —d (mod p), 0 € d' € p — 1). From Lemma 3.2 it 


follows that Qn» depends only on the classes of n,m modulo p. 
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Finally, using Lemma 3.5, 


p-1 p—1 p-1 
5 Anm = 5 [d,m + nd, 0] 
m=0 m=0 d=0 
p—1 p-1 
= 3 [d,0,m + nd] 
d=0 m=0 
p—1 


(2k — [d, 0]) 


I 
[7] 


d—0 
= 2kp — 1 = q — 2. 


Next we consider the expressions (for any integers n,m) 


(3.7) 
p—1 
sim 5 An, dAn, dim = 5 [c,d + ne, 0Ï[e, d + m + ne, 0]. 
d,c,e=0 
We have: 
LEMMA 3.8. 
p—1 
Onm = 9 (a — Din — 1)j +m, O][nj +m, 0] 
j=0 


+ (2k — [(n — 1)j + m,0])(2k — [nj + m,0])} — 2k, 
and if n Z 0,1 (mod p) then 


| f 2k(q—4)--q when m=0 (mod p), 
LEE 2k(q — 4) when m £ 0 (mod p). 


PROOF. 


p-1 p—1 
Osa = | led- ne,0fe, d+ m net] 


p—1 
= | [c — nc, —nc, dlle — ne — m, =ne ZI 
c,e=0 \d=0 
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By Lemma 3.4 the above sum is equal to 
p—1 


3 (le nc,—nc,e — ne — m, =ne — m] 


c,e=0 


p—-1 /p-1 
5 (Ste - ne + ne-m, -ne-ne + mel — 2k. 


e=0 \c=0 


Let j = e — c (for any fixed e), then c — nc + ne +m = (n — 1)j + 
m 4 e, -nc + ne -- m — nj +m. Hence the sum is equal to 


EE nieneseniema) — 9k 


j-0 


p-1(p-1 
= [Eo ni em + en -2 


e=0 
and according to Lemma 3.6 the sum is equal to 


p—1 


S {Ga 1){(n - 1)j + m, O] [nj + m, 0] 


| + (2k — [((n — 1)j + m,0])(2k — [nj + m, 0])} — 2k. 
If n 40,1 (mod p) and if m = 0 (mod p) then 


o = Y (a = Ulo — 10n, 


+ (2k — [(n — 1)5, 0] (2k — [nj 0]) ; — 2k 
= (q— 1) +4pk? — 2k — 2k + 1 — 2k 
= q+ 2k(q — 1) — 6k 
q+ 2k(g — 4). 
If n Z 0,1 (mod p) and m #0 (mod p) then a similar computa- 


tion gives 


On m = 4(p — 2) + Ak(2k — 1) — 2k = 2k(q — 4). 
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LEMMA 3.9. If n # 0,1 (mod p), for every integer m we have the 
inequalities 


q—2—(p—1)q < Paum <q—2+(p—1),/g. 
PROOF. We have seen that 


Ono = 029-024 +0, 4 = 2k(g — 4) - q 


»p—1 
and if r #0 (mod p) then 

On,r = An,0An,r + An,1An,r+1 HPht ste Qn, p—1Qn,r+p—1 = 2k(q E 4). 
Then 


(0.0 Sei Anr)” F (0.1 — Ole een) St ee (Qn p-1 =] Os coppa) 


p—1 p—1 
2 E a.) — 2 E TA 
c=0 c=0 


= 2[2k(g — 4) + q] — 4k(q — 4) = 2q. 


But then 


2q > (ds = di) SR Fée Kë Anm) 


2 ( Qn, m+r + tien fe = SE 
Dia an | 
: 2 2 


2 
Qn,m+r T Qn,m—r 
2 | Onm ` à 
(e | 2 ) 


IV 


Since J/q is irrational, we have the strict inequalities 
On. mr + An,m-r 
Vq > Anm 7 > —J. 


The above inequalities hold for every r = 1,2,...,p —1. Adding 
them up, and noting that 


pi pol 
Q m+r zk: On,m-—r \ Da ant SR Dann 
L 2 = One +) 2 
r—1 r=0 
and that 
p-1 pl 
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(by Lemma 3.7), then 
(p — IVa > (p — Haan — (70s, +q - 2) > —(p — 1) V4 


hence 


q—2-—(p—1)q < Panm «q-—2-(p—1)q. 


After these lemmas, we turn to the consideration of the congruence 
(3.8) AX? + BY? + CZ? z 0 (mod q) 


(with A,B,C nonzero integers). Let A = h* (mod q), B = h? 
(mod ol, C = h° (mod q), with 0 € a,b,c < q— 2. With these 
notations, we have: 


(3A) 
p—1 
pla, b, c] = 6k +2 — q—v + 5 Qn,b—c—n(a—c)) 
n=2 
where 


v = [a,b] + [b,c] + [c, a] 
0 when a,b,c are pairwise incongruent modulo p, 
| 3 when a,b,c are congruent to each other, 
1 when two of a,b,c are congruent, but not the other one. 


PROOF. We shall compute the sum S = 577 de d—ne Where d, e are 
integers. By definition, 
—1p-1 
S= Soia — ne + nj, 0]. 
n=0 j=0 
If j =e (mod p), then for every n we have d—ne+nj =d (mod p). 
If j e (mod p) then (d — ne + nj |n 20,1,...,p— 1} is a set of 
pairwise incongruent integers modulo p. Therefore 
—1p-1 
S = ple,d,0] 4 - ^ Y 15,40] 
t=0 j=0 
p—1 


= ple, d, 0] aU 5 (2k z D 0]), 


j=0 
je (mod p) 
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by Lemma 3.5. So 


S = ple,d,0|] + 2k(p — 1) + fe, 0] — SCH 0] 


j=0 
= ple, d,0] + 2kp — 2k + [e,0] — 1 
ple, d, 0] T [e, 0] Tg. 2 — 2k. 


But ao a = 2k — [d,0] and ai a-e = 2k — [d — e,0] = 2k — |d, e], hence 


p—1 
ple, d, 0] = 5 On,d—ne + 6k — (q = 2) = {[d, 0] KR |d, e] + [e,0]}. 
n=2 
Let e = a— c, d= b— c; then pla, b, c] = pla—c, b—c, 0] = ple, d, 0| = 
Na Cao ated Bee 


After these preliminaries, we may prove the theorem of Hurwitz 
(1909): 


(3B) The number N of solutions (x,y,z), with 1 <a,y,z<q-1 
of the congruence (3.8) satisfies the inequalities: 
(a—1)[(a+1) - (p- D(p-2)/q — pv] < N 
< (q — 1)[(q +1) + (p — 1)(p — 2) — pv]. 


PRoor. By Lemma 3.9 we have 
(»-2)a-2-(p-1) d] « p E au-e neo < (P-2)1g-2- (5-1) V], 
hence by (3A) we have 
p(6k -2—q—v) - (p-2)|(a - 2) - (p - yq 
< p'[a,b, c] < p(6k +2 — q- v)  (p- 2)(a — 2) + (p — 2) q], 
that is, 
q--i-vp-(p-2)(p—1)yq < Globe 
< qt 1—-vp-(p-2)(p- 1). 
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But by (3.5), N = (q — 1)p?[a, b, c], hence 


| 
(a — Dila +1) — (p—1)(p — 2) — pv] 
< N <(g—1){(a+ 1) - (p— 1)(p — 2)/q — pv]. 


To guarantee that the congruence (3.8) has a solution, it suffices 
to show that q +1 — (p — 1)(p — 2),/q — pv > 0. 


(3C) Ifq>(p—1)?(p— 2)? +2(prv —1) then N > 0. 


PROOF. It was shown in (2) of (2C) that if q > (p— 1)?(p — 2)? + 
2(pv — 1) then q > (p — 1)(p — 2) /q + pv — 1. It follows from (3B) 
that N » 0. 
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X.4. Fermat's Congruence Modulo a Prime-Power 


We shall consider in this section the congruence 
AT - Y?" 4 Z°” (mod p^), 


where p is an odd prime number and n > m > 1. According to (1B) 
we may restrict our attention to the congruence 


(4.1) X?" Y?" EZ?" =0 (mod p™*?), 


We have already begun the study of this congruence in Chapter 
VI, $1. We recall (Chapter VI, (1H)) that there exist integers +, y, z 
not multiples of p, satisfying the congruence (4.1) if and only if there 
exists a, 1 < a < (p — 3)/2such that 1+a?" = (1--a)" (mod p"). 

More generally, let k > 3, m > 1 and p be an odd prime. We wish 
to study the congruence 


(4.2) X?” + X?” dps XP” = 0 (mod p. 
A (nontrivial) solution is a k-tuple (£1, £2,... L2 of integers, 1 < 
zi € p" —l,plz(foralli—-1,...,k), 2? Laf 4.422 =0 
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(mod p™*!). Two solutions (£1, £2,... , £p) and (yi, ya, ... , yx) are 
said to be equivalent if there exists some integer a, not a multiple 
of p, 1 € a € p"*! — 1, and a permutation x of (1,2,... , k], such 


that y; = azz) (mod p"*!) for i = 1,...,k. This is clearly an 
equivalence relation in the set of solutions. 

For each integer a let à = a (mod p"*!) and let (Z/p"*!)* be the 
multiplicative group of invertible residue classes modulo p"'*!. Let 


U = Um" 
= {b| there exists a, prime to p and such that b = a?” } 


and let 
V=V(p™t') = {b|b=1 (mod p)}. 


As is well known, U,V are subgroups of (Z/p"*!), U has p — 1 
elements, V has p™ elements, and 


(4.3) (Zip &UxV 


(see any standard text on elementary number theory, or Ribenboim's 
book on algebraic numbers, 1999). 

Let AU = {9} m; esed fori=1,...,h} for every h > 1. 
Then the congruence (4.2) has a solution if and only if 0 € kU. 

For example, let m = 1. If p = 3 then U = {1,8}, and it is easy 
to verify that 0 ¢ 3U, 5U, TU, but 0 € AU for all h Z 1,3,5,7. 

We deduce that a cube cannot be a sum of two, four, or six cubes, 
if these numbers are not divisible by 3. Otherwise, if, for example, 
y? = V r3 then (-y)?-- 325 , 23 = 0, hence also —y? +56 z — 0 
(with =y, z;, belonging to U), which is a contradiction. 

Similarly, if p = 5 then U = {1,7,18,24}. By computation we 
verify that 0 ¢ U,3U,5U, but 0 € AU for all h Æ 1,3,5. Thus, a fifth 
power cannot be a sum of two or four fifth powers, if these numbers 
are not divisible by 5. 

Let g be a primitive root modulo p, 1 < g < p, and let r = g?” 
(mod p™*"), 1 < r < pt, Then 1,7r,r?,r?,... ,r? ? are pairwise 
incongruent modulo p"*!, and U = (1,7,7?,r?,... , r??). In other 
words, given g, every element 7” € U is uniquely equal to some 
power 7, with 0 < i < p—2. So every solution of (4.2) corre- 
sponds bijectively to a representation of 0 as a sum of powers of r in 
(Z/p™*")*, that is, to a congruence 


p pq p... pie =O (mod pte. 
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with 0 € à, € p — 2 (fort — 1,... ,k). 

Two such representations (r**,... , r*+) and (r?*,... , r) of are 
said to be equivalent when the corresponding solutions of (4.2) are 
equivalent. Explicitly, there is a permutation a of {1,2,...,k} and 
an integer h, 0 < h < p — 2 such that i, = jra) + h (mod p — 1) for 
Ul 

A representation (r*!,... ,r'*) of 0 is normalized when i, = 0 < 
dg Sets < ip < p—2. Every representation is equivalent to a 
normalized representation, as easily seen. However, as we shall see, 
an equivalence class of representations of 0 may contain more than 
one normalized representation of 0. 

A cyclic solution of (4.2) is a solution (z1,25,... ,æ,) where zı = 1 
(mod p™t'), z; = af! (mod p"*!) for j = 2,... , k for some integer 
a, not multiple of p. Each cyclic solution corresponds to a cyclic 
representation of 0, which is a representation (relative to a given 
primitive root modulo p) of the form 


(4.4) l-Er +r” cede DE = 0 (mod pt!) 


(for some i, 0 € i € p — 2). 


(4A) If p = 1 (mod k) then there is a cyclic representation, 
namely taking à = (p — 1)/k. 


PROOF. 


(1t+rit---tr@D) 1 —r) =1-—r" =1-r?-1=0 (mod p"). 


But r = op (mod p™*!), so r = g (mod p), hence r is a prim- 
itive root modulo p. If rê = 1 (mod p) then p — 1 divides i = 
(p — 1)/k so k = 1, contrary to the hypothesis. This shows that 
(1 — r*) (mod p"*!) is invertible, hence 1 + rê +... + r@-Dt = 0 
(mod p'**!). 


In the particular case where k — 3 and m — 1 we have the cyclic 
representation 1 4- r' +r” = 0 (mod p°) where i = (p—1)/3, r = of 
(mod p?). So rê = (gP-1/3)P (mod p?) and (if = gP? = 1 
(mod p°), thus rê is a cubic root of 1 mod p?. 

The following criterion for Fermat's theorem relies on the existence 
of a representation which is not cyclic (see Klésgen, 1970): 
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(AB) fm 2 1, p is an odd prime and if there exist integers x, y, z, 
such that p Y zyz and zx" + y?” + zP^ = 0, then 0 has a noncyclic 
representation modulo mim"). 


PRoor. We have 
QU yr yq), 
hence by the result of Pollaczek, quoted in Chapter VI, (2S), we have 


+ (ay) +y" £0 (mod p) 


m— 


ae 
and therefore 
x? +ary+y? #0 (mod p). 

Let w be an integer such that wx = y (mod p). So 


l4 w--w!z0 (mod p). 


From x?” + y?" + 22" = 0 we deduce that a?" yr" +22 =0 
(mod p*"*1) (see Chapter VI, (1M)). We have also x + y +z = 0 
(mod p). So z = —(a+y) = —x(1+w) (mod p) and therefore 


3 


PT stung" 


(mod peti). 


We deduce from the hypothesis that 
az?" (1+ w” —(1+w}")=0 (mod p?"*!), 
so 
1+w?” —(1+w)?” 20 (mod p?! 
If —(1 + w)?" = w?*"" (mod p?"*!) then —(1 + w) = w? (mod p), 
hence 1+ w + w? = 0 (mod p), which is a contradiction. 
If wë" = (1+ w)?” (mod p?"*!) then w = (1+ w)? (mod p), 


so 1 + w +w? = 0 (mod p), again a contradiction. Thus we have 
obtained a noncyclic representation of 0 modulo mim", 


Thus, for example, if p= 1 (mod 3), if the only representation of 
0 modulo p* is the cyclic representation 


l+rt¢r% = (mod pf), 


where g is a primitive root modulo p, and r = "a (mod p*), then 
the first case of Fermat’s theorem holds for the exponent p. 
Following Klósgen, and keeping the preceding notation, we show: 
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(4C) (1) (linn (mod p"*!) withl<i<j<p-2, 
then the normalized representations equivalent to the above one are: 

(Ri) 1 r! 4 r? =0 (mod p™*"); 

(Ro) 14- rj + r?717* =0 (mod p™*"); and 

(R3) 14 r? 7173 + p97173** = 0 (mod p"*?). 
If j = 2i then the representations (R4), (R3), (H3) coincide. If j z 
21, the representations are distinct. 

(2) 1f1+1+7/72=0 (mod p™*"), then 1 € j and the normalized 
representations equivalent to this one are: 

(Ri) 14-14 rj =0 (mod p”*1); and 

(R2) 1 4 r?7177 + p? 717 = 0 (mod min. 
In this case the representations (R|), (R5) are distinct and 2?" = 2 
(mod p™*?), 


PROOF. (1) From (R;) we obtain (R2) by multiplying with ii: 


rPI p rP pp TS sep (mod p"**!), 


so 1 4- rJ +r? = 0 (mod p™*"), and this is a normalized repre- 
sentation (because 1 < j — i < p—1—i< p — 2) which is equivalent 
to (R4). 

In the same way, we see that (R3) is equivalent to (R5). If r^ + 
r^t! 4+ ph*3 = (mod p™*") is an equivalent representation which is 
normalized then one of the three cases must happen: 

(a) h =0 (mod p — 1), which yields (R,); 
(b) h+i=0 (mod p — 1), which is equivalent to (R2); and 
(c) h+j 2 0 (mod p — 1), which is equivalent to (Rs). 

If j = 2i then 1 + r +r” = 0(mod p™'), 1 cr r1 = 0 
(mod p"*!) and 1 + r?717? 4 p?-17? = 0 (mod p"*!). So r” = 
r? 1^ (mod p"*!). Therefore 2i = p — 1 — i (mod p — 1). But 
1 < 2i, p—1—i < p—2 hence 2i = p— 1 — i, thus i = (p— 1)/3. We 
conclude that the representations (R1), (R5), (R3) coincide with the 
cyclic representation 1 + r(?7)/3 + p2@-Y/3 = Q (mod p™*?), 

It remains to see that in all other cases these representations are 
distinct. 

If (R,) and (R2) coincide then i = j — i, so j = 2i, contrary to the 
hypothesis. Similarly, if (R,) and (R3) coincide then i = p—1—J, j = 
p—1—j+iand again 27 = p— 1 +i, 27 = 2(p — 1) — 2i, hence 
i = (p — 1)/3 and j = 2(p — 1)/3 = 2i, contrary to the hypothesis. 
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Finally, if (R2) and (F3) coincide, then j— i = p-1—j, p-1—i-— 
p—1-—j-4 i and once more 2i = j, contrary to the hypothesis. 

(2) If1+1+7? = 0 (mod p"*!) then we have the normal- 
ized equivalent representation (R5) (obtained by multiplication with 
r?-1-3). We have also j Z 0, since pt! > 3. So (Rj) is not the 
same representation as (R1). In this case 24-r/ = 0 (mod p"*?). Re- 
calling that r = of" (mod p"*!) and noting that o = g (mod p), 
then 2 + g? = 0 (mod p), so g’ = —2 (mod p) since gi?” = —2?” 
(mod p"*!), and therefore 2^" = 2 (mod p™*?). 


We may rephrase this result as follows: 


(4D) The equivalence classes of solutions of 
1+ XP +X?” =0 (mod p™+!) 


consist of six distinct solutions, with the following exceptions: 

(a) p = 1 (mod 6), a Z 1 (mod p?) but oi = 1 (mod p°): in 
this case (1,a,a?) and (1,a?,a) form an equivalence class of 
solutions. 

(b) 27" = 2 (mod p™*"); in this case (1,1, 2), (1, C2, 1) and 
(T, (—2)P" 0-2, (-2)»" 0-2) form an equivalence class of so- 


PROOF. According to (4C), if p = 1 (mod 6) the given congruence 
admits the cyclic solution; its equivalence class contains exactly two 
solutions (of which one is normalized). If p Z 1 (mod 6), there is no 
cyclic solution. If 1 4- 1-- 77 =0 (mod p"'*!) then there are precisely 
three solutions in this equivalence class (of which two are normalized 
and if this happens then 2?" = 2 (mod p"*!). In all other cases, 
each equivalence class contains exactly six distinct solutions. 


We show now that in certain cases, it is possible to obtain a new 
solution of (4.1) from a given one (Peschl, 1965): 


(4E) Ifl+r’+ri = 0 (mod p™"), where j = 3i + (p — 1)/2 
(mod p — 1), then 1 + r + r(?-0/2*5: = 0 (mod p"**!) and this 
representation is not equivalent to the given one. 
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PROOF. Since r(?-2/7 = —1 (mod p"*!) then 1 + ri — r$ = 0 
(mod p"*!). Hence —r — r + e 0 (mod p"*!) and rä + 
vi — pr? = 0 (mod p"*!). Adding these congruences, we obtain 


1 +r“ — r = 0 (mod p"), that is, 1 + rf + r(»-0/2*5: = 0 
(mod p'**!). 

If this normalized representation is equivalent to the given one, 
by (4C) we must have one of the following three cases: 

(a) 4i = i (mod p — 1) and 5i = 3i (mod p — 1), hence i = 0 
(mod p — 1), so i = 0 and 1 +1 — 1 = 0 (mod p"*!), a 
contradiction. 

(b) di = 24 -- (p—1)/2 (mod p— 1) and (p—1)/2+5i = p—-1—i 
(mod p — 1) and this leads, as before, to a contradiction. 

(c) 4i = (p — 1)/2 — 3i (mod p — 1) and (p — 1)/2 + 5i = (p 
1)/2 — 2i (mod p — 1), leading again to a contradiction. 


In a similar way, we have: 


(4F) Ifp=1 (mod 4) and 
124 r0-9/ Gud 20 (mod pm 


then 
14+itr%+e-1/4= 9 (mod p"*l), 


and this representation is not equivalent to the given one. 


PROOF. 
LltreD/44 ri 20 (mod peo 
hence 
elle 4 1 Se pi +(p-1)/4 = ( (mod ger?) 
and 
tpIt@-D/4 _ pi 4 725+(-1)/4 = 0 (mod pus 


Adding these congruences, we have 

L+1tr%t+@-D/4 20 (mod p"). 
We note that r®-)/4 Æ +2 (mod p™+!), otherwise —1 = r(?-)/2 = 4 
(mod p™*'), which is impossible. So j # 0 (mod p — 1) and j Æ 


(p—1)/4 (mod p— 1), so the representation obtained is not equivalent 
to the given one. 
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We shall now concentrate more on the study of the number of 
solutions of (4.2). Our method will be similar to the one in §2, in 
connection with Dickson’s theorem. 

We introduce the following notations, where p > 2 is any prime 
number, m > 0, k > 3: Let F(p, m, k) be the number of (21,... , £p), 
such that 1 € x; < p — 1 (for à = 1,... , k) and 


af As ex? =0 (mod p"H). 


Let a be any integer, 1 € a € p — 1 and let F(p,m,k;a) be the 
number of (z;,... , 24) such that 1 € x; < p—1 (for i = 1,...,k) 
and 

zf + ab +--+. +a?" map? (mod p"). 
Let N(p, m, k) be the number of (x2,... , £p) such that 1 € x; < p-1 
(for i = 2,... , k) and 


1+ +---+a?" =0 (mod p™*?). 


If k = 3,m = 1, we shall simply write F(p) = F(p,1,3), F(p;a) = 
F(p,1,3;a), N(p) = N(p.1,3). 

First we indicate some relations between these various numbers. 
Then we shall derive inductive formulas in terms of certain periods 
of the cyclotomic field and, in turn, in terms of Jacobi cyclotomic 
sums. 

For m = 0 it is easy to compute explicitly: 


(4G) 


In particular, 


F(p,0,3) = (p — 1)(p — 2), 
N(p,0,3) = p — 2. 


PROOF. If 1 <a; < p-1fori—l,...,k—2andz,-4---- 4.2 #0 
(mod p) then we may choose p — 2 values for z, 1, such that 1 < 
mu € p—1l, mut, kp 24-141 É 0 (mod p), and this determines a 
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unique £p, 1 € £k € p—1, such that pt: xy 4--z, = 0 (mod p). 
So we have already [(p — 1)*~? — F(p,0, k — 2)](p — 2) solutions. 
Now if 1 < xz; < p— 1 for i = 1,... ,k — 2 and zı +:--+£k-2 = 0 
(mod p) then we may choose p — 1 values of zu (which determine 
£p) and hence we have F(p,0, k —2)(p — 1) solutions. Thus for k > 3: 
F(p,0, k) 
= [(p— D? F(p,0, k — 2)|(p — 2) + F(p, 0, k — 2)(p — 1) 
= (p—1)**(p — 2) + F(p, 0, k — 2). 
In particular, F(p,0,3) = (p — 1)(p - 2), F(p,0,4) = (p — 1)*(p — 
2) + (p — 1). From the above relations we obtain 
F(p,0, 2k RE 1) = (p ~~ (elt? SCH 2) SR F(p,0, 2k E 1), 
F(p,0, 2k ES 1) m (p E Des > 2) zs F(p,0, 2k =~ 3), 


Hence, adding up, 


F(p,0,2k +1) = (p—1)(p — 2)! ae — 1) 
= (p—1) S (p 1)25+1 E E (p— 1)2 
-= (p—1)?* -1 (p—1y*—1 
= (p "G-Dc1 = (p—1) S 
Similarly, 
F(p,0,2k) = (p- 2) (p - 1 +1 
EN (p— 1) — — (p- 9 41 
= (p—1) Y 1)? = (p— 1)! 
= (e 1 E (p- € puc 
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Hence, whether & be even or odd, we have 
(p- 17 - (1) 
5 ; 


F(p,0,k) = (p — 1) 
'The same argument gives 


N (p, 0, k) rw (p — ere 2) T N(p,0, k WE 2). 


Thus 
N(p, 0, 2k + 1) = (p — 1)*-?(p — 2) + N(p, 0, 2k — 1) 
and 
53 1 2k 
—3y*-—1 
N(p,0,2k 4- 1) B a 
P p 


Similarly, N(p,0,2k) = ((p — 1) #71 + 1)/p, so for any values of k: 
ped + (-1)* 
S 


N(p,0,k) = 


More generally: 


(4H) With the above notations: 
(2) F(p,m,k;1) = F(p,m,k;2) = --- = F(p,m,k;p — 1). This 
number shall be denoted by F*(p, m, k). 
(3) N(p,m,k) = N(p,m —1,k) — F*(p,m,k) for m > 1. In 
particular: N(p,1, k) = ((p — 1)! + (—1)*)/p — F*(p, 1, k). 


PROOF. (1) We consider the sets F = ((zxj,...,z,) | 1 < x; < 
p—1fori — L...,k and a? +--+?" = 0 (mod p™")} and 
N = {(£2,... £k) |1 <a; <p—1fori=2,...,kand1+a2 + 
pa? =0 (mod p™*)}. If y is any integer, 1 < y € p — 1 and 
(z2,... ,2x) EN, if yi = y, yi = yx; (mod p) for i = 2,... , k, then 
(Y1, Ya, --- a) € F. Different values of y yield EE Solutions of 
X?” + À - X" = 0 (mod p"*!). 

If (£2,..., M c N and (25,...,2,) € M, with (a,...,2%) F 
(z5,... , 25), if y, y are integers such that 1 € y, y' € p— 1, the above 
method leads to distinct solutions (yi, gs,... , ux) Æ (Ui us Ye) 
because if y Z y’ then yı Z yj, and if y = y’ and say zı Z xj, then 
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It is also clear that every (yi, yo, ... , yx) € F may be obtained in 
this manner, namely taking z;, 1 < x; € p — 1, such that y; = um: 
(mod p) for i= 2,...,k. Thus, F(p,m,k) = (p — 1)N(p, m, k). 

(2) Let 1 < a,b < p — 1, and let c be such that b = ca 
(mod p), 1 <c € p— 1. If (£1, z2,... , 24) is such that 1 < x; < p—1 
and 

af +2 tipak =ap™ (mod p"), 
then letting y; be such that 1 € y; € p — 1, y; = cr; (mod p) then 


y" =P" af (mod p"*!). From c?" = c (mod p) we conclude that 


V tub te tuk zs asf" (mod pt). 
In this way we establish a bijection between the sets of solutions of 
xi sf x?” fe ecaa x?” = ap" (mod p"*!) 
and of 
X?” + X?” + = bp™ (mod pe: 


Therefore F(p, m, k;a) = F(p, m, k; b). 

(3) Let ap. : Rt AE = 0 (mod p") with 1 € x; € p—1 (for 
i=1,...,k). Then 2?” +--+?" =0 (mod p”) since zf = z?” 
(mod p"). Hence there exists a, 0 < a € p — 1, such that 

zf pea? map" (mod p™*?), 
Thus (2,,... , z&) is a solution of 
AG +--+ X2” =0 (mod pt) 
or a solution of 
X?” +--+ XP =ap (mod p™ 


for some a, 1 < a € p — 1; and conversely. 
Hence, by Part (2), 


F(p,m —1,k) = F(p, m, k) + (p — 1)F* (p,m, k). 
By Part (1), 
N(p,m —1,k) = N(p, m, k) + F* (p,m, k). 
In the special case where m = 1, by (4F) we have 
(petzen 
p 


N(p,1, k) = F*(p,1, k). 
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If p > 3, let 
0 when p = —1 (mod 6), 
1 when p=1 (mod 6), 


and " 
when 2?" = 2 (mod p"*!), 


1 
qim) = { 0 when 2?” 42 (mod p"). 


Then we have 


(41) 
^(p, m) + 3y(p,m — 1) (mod 6), 
(p,m) + 26(p) (mod 6). 


F*(p,m,3) = 


3 
N(p,m,3) = 3 


PROOF. The solutions of 1+ X?" + X?” =0 (mod p™*!) are orga- 
nized into equivalence classes. By (4D), these classes consist of six 
elements, except when p = 1 (mod 6), where there is a class of only 
two elements, and when 2?" = 2 (mod p™*"), where there is a class 
of three elements. Thus 


N(p,m,3) = 3y(p,m) + 26(p) (mod 6). 
By (4H), 
F*(p,m,3) = N(p,m — 1,3) — N(p,m, 3) 
= 3q(p, m — 1) — 3y(p, m) 
= 3(p,m — 1) +3y(p,m) (mod 6). 


For the special case where m — 1, k — 3, we have 
N(p) = 3y(p, 1) + 26(p) (mod 6). 


In order to indicate an upper bound for N(p) (with p > 3) we 
need to study in more detail the Cauchy polynomials modulo p. We 
recall from Chapter VII, 82, that 


(X +1)P — XP —1=pX(X +1)(X° + X - 1YC,(X), 
where C,(X) € Z[X], 


_ f 1 when p = —1 (mod 6), 
EU when p = 1 (mod 6). 
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X? + X +1 does not divide C,(X), C,(X) is a symmetric monic 

polynomial, C,(—1 — X) = C,(X), hence C,(0) = C,(—1) = 1. Let 

(X +1) — EE 
p 

and q(X) = q(X) (mod p), C,(X) = C,(X) (mod p). 


q(X) — € ZX] 


(4J) For p >3, we have: 
(1) All the roots of G(X) (different from 0 and —1) in F,, and 
all roots of C (X) in F, are double roots. 
(2) Ifa is a root of C,(X) then each element in the set 
a 1+ gd 
lt+a’  1+a o 


1 
M, = le ui + a), 
Q 


is also a root of C,(X). If M, has less than six distinct el- 
ements, then Ma = (1, —2, (p — 1)/2 (mod p)} (in this case 
2? = 2 (mod p°)) or p = —1 (mod 6) and o? +a 4-1 = 0, 
aœ ¢ Fp. 

(3) X? +X c 1€ F,[X] does not divide C (X). 


PROOF. (1) Let o € F, be such that g(a) = 0. 
(X +1) — X?^1. If a Z 0, —1 then (a +1)? 
q (a) = 0. Since g"(X) = (p — 1)[(X + 1)? — X? i then 


Ta = -+D ae À = D 


This shows that a 4 0, —1 is a double root of q( X). 

From q(X) = X(X + 1)(X? + X + 1)°C,(X) if C,(o) = 0 then 
g(a) = 0, so taking derivatives: 0 = (a? + a + 1)C (a). We have 
o? +a+1 4 0, otherwise since a € F, then necessarily p = 1 (mod 6) 
(Chapter I, Lemma 4.1). Hence € = 2 and C,(a) = 0, since ais a 
double root of g(X). Thus C, (a) = 0, and a is necessarily a double 
root of C,( X). 

(2) Since C, (X) is a symmetric polynomial such that C,(—1 — 
X) = C,(X) and C,(0) = C,(—1) = 1, then if C,(a) = 0 then each 
B € M, is also a root of C,(X). We suppose that M, has less than 
six elements. Then one of the following cases happens: 


(i) a = 1/a: then a = +1. 
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a = —(1+ a): then a = (p — 1)/2 mod p. 
a = —1/(1 +a): then a?+a+1=0. 
iv) a = —a/(1+ a): then a = 0 or a= -2. 
a=—(1+a)/a: then a?+a+1=0. 


M, = (1, -2, (p — 1)/2 mod p) and from ((1+ 1)? — 1° — 1)/p = 0 
(mod p) we have 2? = 2 (mod p°). If o? +a +I — 0 with o € F, 
then p = 1 (mod 6) so € = 2. Since a is a double root of q( X) it 
cannot be a root of C,(X). Thus a ¢ F, and p= —1 (mod 6). 

(3) If p = 1 (mod 6) then e = 2, X? + X +1 has root o € F, 
which is a double root of g(X), hence not a root of C,(X). Hence 
X? + X +1 does not divide C,( X). 

Let p = —1 (mod 6), so € = 1. The roots of C,(X) appear in 
groups of six distinct roots, with the following exceptions: 

(i) the group of three double roots (1, —2, (p — 1)/2 mod p}; and 
(ii) the two roots of X? + X + 1 (which are necessarily outside 
F,). 

If C,(X) = (X? + X - 1! H(X) with r > 1 and X? + X +I not 
dividing H(X) € F,[X], then 


deg C,(.X) = 2r + deg h(X) 22r (mod 6), 
in view of the grouping of roots of H(X). But if p = 6n — 1 then 
deg C,(X) = 6(n — 1), so 2r = 0 (mod 6), hence r = 0 (mod 3) and 
therefore r > 3. Thus if o? +a +I = 0 then a is a triple root of 
C, (X), hence also of q( X). Therefore 
TX) = (X +T- xe 


and 
TX) = (p - DX +TP - x=] 
vanish at a: 


(a +IP? = at, (a +I)? =a??. 


Comparing, we have 


a?-! = (a + I) (a + 1)? ? = (a + la? ? = of +0", 


hence a?~? = 0, so a = 0, a contradiction. 
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We may now determine an upper bound for N(p): 


(4K) 
(1) If 2? Z 2 (mod p?) then 
Gd when p = 1 (mod 12), 
b when p = 5 (mod 12), 
Nip) S Es 
um when p =7 (mod 12), 
— 11 
3 7 when p = 11 (mod 12). 
(2) If 2? =2 (mod p?) then 
—3 
— when p = 1 (mod 12), 
— 11 
= 7 when p = 5 (mod 12), 
Mini zx y 79 
when p =7 (mod 12), 
— 5 
> when p = 11 (mod 12). 


PROOF. We recall that the nontrivial solutions of 1 + X? + X? = 
(mod p?) correspond to the nontrivial solutions of 1+ X? = (1+ X)? 
(mod p?), that is, to the zeros in F, distinct from 0, —1 of 


(X +1)? — XP —1 
p 
(1) Ifp=12n+1 then q(X) = X(X +1)(X? + X 4+ 1)°C,(X), 
deg q(X) = p — 1, deg C,(X) = p — 7 = 12n — 6. Since every root of 
C,(.X) in F, is a double root and the roots appear in groups of six, 
then 


aX) = 


(mod p) € F,[X]. 


6n—1 
6 


LI 4= 


N(p) < 2+6| |=2+6n 6 — 6n—4 


If p = 12n +5 then q(X) = X(X +1)(X? + X +1)C,(X). The 
same argument gives (noting that X? + X + 1 has no root in F,): 
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degC,(X) = 12n, N(p) € 6n = (p—5)/2. If p = 12n +7 and 
p = 12n + 11, we proceed similarly. 

(2) If2? 2 2 (mod p?) then g(X) = X(X +I)(X?+X +1) (X — 
1)?(X + 2)?(X — (p — 1)/2 mod p)? A(X) where the roots of A(X) € 
F,[X] appear in groups of six distinct double roots. 

If p = 12n + 1 then gcd A(X) = 12(n — 1) and N(p) < 2-34 
6(n — 1) = 6n — 1 = (p — 3)/2. 

In the same way we derive the other upper bounds. 


All the preceding considerations do not yet provide any explicit 
formula for the numbers of solutions of the congruences in question. 
As we shall now see, such formulas may be obtained using Gaussian 
periods and Jacobi cyclotomic sums, as in 82. 

Let p be an odd prime, m > 1, let h be a primitive root modulo 

p"*!, let ¢ be a primitive root of 1 of order p", and let p be a 
Sante root of 1 of order p™*!, p? = (. The Gaussian periods 
M = (p, m, h) are defined as follows: 


(4.5) 
Do = pt ph ra tel, 
r 8 p ER p + pn norte pr” 2)p™ + , 
Npm-1 = pu + oe ES p PER DO DRE, 


It is also convenient to agree that n; is defined for any index j, by 
letting n; = m when j = à (mod p"), 0 € i € p" — 1. 

If g is another primitive root modulo p"'*!, then g=h" (mod p™*') 
where gcd(r,p™(p — 1)) = 1. If m’ = n;(p,m,g) then ml = n (for 
i —0,1,... , p" — 1). Indeed, 


p-2 E p—2 p-2 DET 
m gi? Ti = p htr” tri SS 
KN = p = "ris 
j=0 a t=0 


because if jr = t (mod p — 1), 0€ t € p—2 then {(jrp™ + ri) 
mod p™(p —1)| 7 =0,1,...,p—2} = {(tp™ + ri) mod p™(p— 1) | 
t —0,1,...,p — 2] as we may easily verify: j Z j^ (mod p”) if and 
only if jr Z j'r (mod p"). Hence up to a change of numbering, the 
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Gaussian periods are independent of the choice of the primitive root 
modulo p™*?. 

Let us observe now that each period 7; is a real number. Indeed, 
n; is the sum of (p — 1)/2 pairs of complex-conjugate numbers: 


hip" ti 4 pi(p-1)/2+35)p™ +i nip" +i pipi nip" i 


=p +p =p Tat ER. 


And we also have the gross estimations 


p—2 
jp ™ +i 
(4.6) Ind s 35 |o" "| «p-1 
j=0 


(the equality would hold only if the numbers per would be all 
multiples of one of them, which is not the case). 

For every t not a multiple of p, let indz(t) = s, where 0 € s < 
(p — 1)p* — 1 and t= hë (mod p™*’). 

The Jacobi sums 7; = 7;(p,m,h) (for j = 0,1,... , p" — 1) are 
defined by 


pti 1 
(43) = (Op) E gm. 
In particular, 
(4.8) To = 0. 
Indeed, 
pti 
To = (1, p) = 5 o. 
2p 


so To is the sum of the primitive (p”*')th roots of 1. Hence 7 is 
the coefficient of the term of degree w(p"*1) — 1 of the cyclotomic 
polynomial 


X? = 1 = win-Un + X(o-2)" + mo + XP" + 1, 


thus 79 = 0. 
We shall see that 7; Z 0 if and only if p Ÿ j. For this purpose we 
require the following lemma about sums of roots of unity: 
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LEMMA 4.1. (1) Let n > 1 and let € be a primitive nth root of 1. 
For every integer a: 


2 n when nm |a, 
£5 LO whenn fa. 


et p be an odd prime, m > 0, and let p be a primitive roo 
2) Let p b dd pri > 0 d let p b imiti t 
of 1 of order p™*!. For every integer a: 


pt q(p"*7) when p"** | a, 
3 sn PT when p" | a, om" Ya, 
FTE 0 when p" Ja. 


PROOF. (1) Let d= gcd(n,a), n = dn’, a = da’, so gcd(n’, a!) = 1. 
Since £? is a primitive root of 1 of order n then €™ is also a primitive 
root of 1 of order n'. Hence 


SE SEN Ce, TS 


But each x, 1 € x € n, may be written as x = hn +y, 1 <y <w, 
0<h<d-—j. So 


æ=1 BEA 
d-1 n 
bn +y 
d 
QS 
h=0 y=1 
d—1 n’ 
d 
DDI 
h=0 y=1 
|. f wd=n when n’ = 1, i.e., n |a, 
" 0 when n' > i: i.e., n fa. 


(2) If p"*! |a then the sum 


is obviously equal to $(p"*!). If p"*! Y a, p" | a, let a = bp”, 
with pb. Then ¢ = o^" is a primitive pth root of 1. Every x, 
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1<a<p™'! is of the form x =hp+y,0<h<p”™-1,l<y<p; 
moreover, p yx when y Z p. So 


If p” /a then 


S+ 5 je y pes 


ple æ=1 
1<æ<pmtl 


by (1), since p"! Ja. 
Next we note that o? = C is a primitive root of 1 of order p". Each 
x, multiple of p, 1 < x < p™+!, is written as x = py, 1 < y € p". So 


D p 
5 pe = 5 cw —0 
æ=1 y=1 


by (1), since p" Ja. We conclude that S = 0. 


The following lemmas concern indices: 


LEMMA 4.2. Let p be an odd prime, m > 0, h a primitive root modulo 
p™*! and i an integer, 1 <i<p—1. Then 


ind,(1 + ip") = oni ` (mod p") 
for some integer a, not a multiple of p. 
PROOF. Let s = ind; (1 + ip"), so h? =1+ip™ (mod p"). Then 
h? E. STE ipt =1 (mod grt), 

Hence sp = 0 (mod p™(p — 1)), that is, sp = —bp"(p — 1) = bp" — 
bp"*!, then s = bp"-! — bp", so s = bp (mod p"). Since 
p | à there exists a such that b = ia (mod p), hence s = Zon! 
(mod p"). 

It remains to show that p Ja. Otherwise s = 0 (mod p"*), that is, 
1+ ip" = RP" (mod p™*") (for some integer c). By (3.3), (14- ip") 


mod p"+1 = h?""* mod p"*! e UNV = {1}, and therefore i = 0 
(mod p), which is a contradiction. 


m-—1 
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For every j = 0,1,... ,p™(p — 1) — 1 let U; = (a € (Z/p™*")° | 
ind, (@) = j (mod p™)}. Since U = (a"" | a € (Z/p*!)*) then U; is 
a coset of (Z/p™*')* modulo U. In view of a later result concerning 
sums of squares of periods, we need the following description of these 
cosets. 

Let 1 € k € m, let i=0,1,... , p"^* — 1, and let 


CM RUN ERI UNT aS 
Similarly, let 
S; = {(1 + ip) -U,[(1 + à + p*-*)p] - U, [1 + (i + 2p7*)p] - U, 
ss [L+ (i+ (p! — 1)p™-*)p] - U}. 


First we note that for each i, the cosets U; iss, Ui ppm (with 
l Æ l') are distinct. Otherwise, à + Ip" = i + lp” (mod p"), 
hence l = ( (mod p^), contrary to the hypothesis. So #6; = p. 

If i Æ i' then S; N S; = 0: if à + Ip"-* = i! + lp (mod p") 
then necessarily i = i’ (mod p"-^), so i = il. 


So LT S; has p™ cosets, that is, it consists of all the cosets 
modulo U. Similarly, the cosets 


[1+ (+ lp™*)p] - U, [+ G+ Up™-*)p] -U 
(with l Z l’) are distinct. Otherwise 


1+(itlp™-*)p- (u +(i+ Kn EeVnU = {i}, 
so 
1- (i4 Ip"-*)p & 1+ (i -l'p^-*)p (mod p™*?), 
hence l = (U (mod p^). So #S! = p". 
If 4X then S; S’ — 0 : if 
[1 + (i + lpmn-*)p] - U = [1 + (à + l'pn-5)p| - U 
then as before 


1+ (i Ip" *)p z (1 +i +lp™™*)p (mod p"*!), 


hence necessarily i = i’ (mod ml, so i = i'. Thus te + S; has 
p” cosets, that is, it consists of all cosets modulo U. 
Now we prove: 


LEMMA 4.3. There exists a permutation n of {0,1,...,p™~* — 1} 
such that S; = Sz). 
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PROOF. Let ind(1-- ip) = g, that is, (1 + ip): U = U;. We show 
first that for every l = 0,1,...,p* — 1, 


ind[1 + (i + Ipn-*)p| = j ten 


for some s, 0 € s < p* — 1; this is equivalent to showing that 


[1 + (i + lp-*)p| - U = Uj sm -& 


and allows us to define 2(7) = j, with S; = Sra). Indeed, let d be 
such that (1 + ?p)(1-- ip) = 1 (mod p"*!) and let | = l'p" with 
pfl,0<7r<k. We define b, 1 € b < p*" — 1 by the congruence 
l(1+i'p) =b (mod p*^"). Then 


[L+ (+ lp™™*)pl[1 + ip] = 1+ + ip)p" ^" 
= | +T( +ip)pr htt 
= 1 + bp hir") (mod rtl). 


But 
(1 NE bp BEE? =1 (mod pus. 
SO 


m—k+r 


1 a bp eR = ht@-Dp Py 


(mod p 
Taking s = t(p — 1)p* then 


1+ (i+ Ip" *)p = (1+ ip)h?” = Mr" (mod ri, 


as we needed to prove. 

We show that if i 4 i' then r(i) Z «(i'). If ? = 1+ip (mod p"*?), 
hd’ = 1+5p (mod p”+1) and j = j' (mod p"-*), let j' = j + sp-*. 
So 1+i'p = (14-ip)h*?" * (mod p"-*), hence bat" " € V. So sp"-* 
is a multiple of (p — 1), hence also of (p — 1)p"-^, that is, sp"-^ = 
t(p — 1)p"-*. Let h?-! = 1 + ap (mod p"*!). Then h*" = (14- 
ap)?" = 1 + atp"-** (mod p"-**!). Therefore 1 + i/p = (1+ 
ip)(1 + atp"-F*!) = 1 + (i + atp"-*)p (mod p"-**?). We conclude 
that i’ =i (mod p"-*), so i’ =i. 

We conclude therefore that 7 is a permutation with the required 
property. 


We now derive some formulas for Jacobi sums and Gauss periods, 
which are analogues to the ones already proved in Chapter IX, 82. 
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LEMMA 4.4. (1) If7 =1,... ,p"* —1 then T; = Tym_; (T; denotes 
the complex conjugate of T;); 
m+l es 
we E when p Jj; 
(2) 973 ={ 0 when p | j; and 
(3) 7; #0 if and only if p does not divide j. 


PROOF. (1) 


pi 


7; = Y CHARLES 


æ=1 


But ind,(—x) = ind;(—1)--ind;(x) (mod p"(n—1)) and ind,(—1) = 
Ip"(p — 1), so Gin (1) = 1. Hence 


m+1 
p 
des 
do ES SE ne Wi " = Tgn-j 
ve 
(2) We have 
prii prt 1 
Tjj = »» CJ inda (2)p* KS Gi inda (y) 7v 
oye zh 


parti pti] 


5 KS Ci (indn (2)—inda(y)) ge—y 


æ=1 
pix p/m 


For every x,y as above, let t£, 1 < t < p"*! — 1, be such that 
y = xt (mod p"^!), hence p |/ t and ind;(y s = ind,(x) + ind;(t) 
(mod p" (p — 1)). Then 


pti pti] 


TjTj = E > = —j ind; ( (05 æ(1—t) 


Pls Ps 
ptio pmtt_y 
= j ind, (t æ(1—t) 
s UR ES 
æ=1 
pit pfx 


Using Lemma 4.1, we have 


p-1 
zB URDE py Qr 
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where the first summand corresponds to t = 1 and the second sum- 
mation to the terms t = 1 + ip”; the other values of t give the sum 
0. By Lemma 4.2 


ind,(1+ ip") = iap"-! (mod p") 
(where p a), so 


p—1 
ny = p™(p— 1) mt D ON 60 
i=l 
_ { p”(p—1)+p” ap . stp yy; 


p”(p—-1)-p”(p-1)=0 ifp]|j. 
(3) This is obvious from (2). 


We note the following connection between the periods and the 
Jacobi sums: 


LEMMA 4.5. Consider the matriz Z = EE E pr_1 and the vec- 
tors 


To To 

T1 Th 
dy . , T] — 

Tpm 1 T]pm —1 


Then: 

(1) ZZ = p™I (I identity matrix), that is, 
5 GEÇTI = pta 
k=1 

(fori, j =0,1,... pe — 1). 


(2) ZT = py, that is, 


peal 


Kee 
j=0 


Gort =0,1,..., p” = 1). 
(3) T= Zn, that is, 


p"—1 


Kä Cony = Ti 
j=0 


(for 420,1, yp 1). 
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PROOF. (1) By Lemma 4.1(1), 


p" p"—1 "i xbv . bans d ; 
ikp—kj — (i-j)k _ J P when p" | i — j, that is, j = i, 
pin 


= = when j Æi. 


p”—1 p”—1 E iis p 
SS Qr — »» CH Y Coen tal pe 
j-0 0 nm 
pti pi 
cating 
= X p” 5 cx iind (x)) 
r—1 j=0 


But by Lemma 4.1(1) 


mM 


m : : 2: 
Y qi Ciim f P when ind,(x) = i (mod p"), 
pum 0 otherwise, 

hence the sum to be evaluated is equal to 


p-2 iA 
KS ez =n, 
a=0 


inda (x)=i (mod p") 


since ind} (x) = i (mod p"), 1 € x € p" — 1 is equivalent to x = 
RP" + (mod om". with 0 € a € p — 1. 

(3) From Zr = p"'r, by multiplication with Z we obtain p" = 
ZZr = p™ Zn hence T = Zn. 


As a corollary we obtain the vanishing of special sums of periods: 


LEMMA 4.6. Let p be an odd prime, m > 1, let 1 < k < m,i = 
0,1,..., p^-* — 1. Then 


CH 


5 Mitæpr-k = (0. 
x=0 
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PROOF. By Lemma 4.5(2): 


p'-1 p'-1p"-1 
m = —(it+ap™—*)j 
p Tic apr -* Tj 
x=0 


Il | 
E: 
Kei 
| 
ER 
fac? : 
| 
ME EE 
A Gei 
ge ER 
E 
M: 
a 
Bi 
8 
Ki 
3 
i 
EX 
Ne 


where € = (?" ^. Since € is a primitive root of 1 of order p*, by 
Lemma 4.1(1), 


-1 : : 
b» £9) = p" if p^ | J; 
0 otherwise. 


Hence 


æ=0 


p^-1 p"—1 i 
p" (x e = p" p» po = 0, 


by Lemma 4.4(3). 


For example, if m = 1 then k = 1 and 6 n; = 0, which was 
already known. If m = 2, k = 1 then 


p—1 
(4.9) Viet for i=0,1,...,p—1. 
j=0 


If m = 2, k = 2 then de Na = 0, as already known. If m = 3, k = 
1 then 


p—1 


(4.10) 3 Nitip =0 for i—0,1,...,p° —1. 


j=0 
If m = 3, k = 2 then 


p°—1 


(4.11) pao for i=0,1,...,p— 1, 
j=0 
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and finally, if m = 3, k = 3 then EE n; = 0, as already known. 
We may also evaluate the following sums of squares of periods: 


LEMMA 4.7. Let p be an odd prime, m > 1, let1l « k m, i— 
0,1,...,p"-* — 1. Then 


PROOF. Let U; = (a € (Z/p"**)* | ind, (a) = j (mod p™)} for 
j —0,1,... , p"(p — 1). Then U; is a coset modulo the subgroup U. 
By definition, 

Tj = 5 p” . 


O<s<(p-1)p™ 
SCH: 


We keep the notations preceding Lemma 4.3: 
S; = (Ui, Ui, pn-5, Ui 2pm-*, e s , Ui+ (pk -1)pm-5 }, 
S; = {(1 + ip) -U, [(1 + ip"^*)p] - U, 
s [L+ (i+ pep) pl U}, 


and let m be the permutation of {0,1,... ,p’~* — 1} such that S! = 
Sr (see Lemma 4.3). Then 


p*-1 


$i Uaec A. A. "Wm 
z-—0 


U;es; Hoen 


Thus we need only to evaluate the sums 


p^-1 p-1 2 
ip--zp" *1-*)g»"* 
He dees ) | 


REM a=1 
We have 
p*—1p-1p-1 
` m+1—-k m m 
S; = kee (a? + bP") 

z—0 a=1 b=1 

prp m m m-— 1 DN 1 
=) a?” +b?” +i(aP +b? ) æ(a? +b? ) 
3 p ¢ E H 

a=1 b=1 æ=0 

m+1—k m-1 


(so e" = 1), noting also that a?” = a? 


(mod p*), and similarly for b. 
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By Lemma 4.1, 


p-1 m "-— 
Si x Ue RE 
a=1 


1 


c S php - 1), 


and this concludes the proof. © 


In particular, taking k = m we have 


p"—1 


(4.12) Dm = pt) 


We specialize now the above results to the case where m = 1. Let 
a, l € a € p — 1 be defined as follows. If s = ind,(1 + p) then 
h^ = 1 + p (mod p?) hence h? = (1 +p)? = 1 (mod p°), hence 
p(p — 1) divides sp, so p — 1 divides s. We define a by a(p — 1) — s. 


LEMMA 4.8. Ifm = 1, a(p—1) = ind; (1--p) then ind, (1+cp) = —ca 
(mod p) for every c = 0,1,...,p— 1. 


PROOF. From h%P-1 = 1+p (mod p?) we have bit" = (1+p)° = 
1+cp (mod p°). Hence indz(1 + cp) = ca(p — 1) (mod p(p — 1)). In 
particular, ind; (1 + cp) = cap — ca = —ca (mod p). 


LEMMA 4.9. If m = 1, the Jacobi sums (for i = 0,1,...,p — 1) are 
given by 
qs 
where a(p — 1) = ind, (14 p). 
PROOF. 


p°—1 


Ti = 5 ginda (2) or, 


x=0 
By (4.3) we may write in a unique way 
zz h”(1+cp) (mod p°), 
where 0 < b,c € p— 1. Then by Lemma 4.8, ind, (x) = pb+ind;(1 + 
cp) = —ca (mod p). Therefore, 


p—1 


p-1i p-1 p—1 
Tj — 5 e 5 po kee = 5 p 5 ER e 
c=0 b=0 c=0 


b=0 
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noting that p? = ¢). Since ia # 0 (mod p) then (ia)? (mod p?) € 
| ) 


U. So there exists bọ, 1 < bo < p—1 such that (ia)? = h”™ (mod p? 
By Lemma 4.1(1), 


K eon) = D when b = bo, 
= ~ | 0 when b Z bo. 


We conclude that 


pb ia)? 
pro — off 


Tj = 


In analogy with (2B) we indicate an inductive expression for 
N (p, m; k) and F(p, m, k) in terms of the periods (see Klésgen (1970)): 


(4L) 
1 pere. 
(1) F(p,m,k) = -F(p,m — 1, k) + — »» në; and 
p p i=0 
1 gp 
i=0 
PROOF. (1) By Lemma 4.1(1), we have 
m+1 1 


p-l pl p—1 p 
^ 


p"! F(p, m, k) Ns ue ue 


241—125—1 tpt. y=0 
p"*!—1 p-1 T. p-1 
= pn SE ` p 
y=0 "zl Tk=1l 
pti 1 /p-1 E: k 
= yc? 
= p 
y=0 z-—l 
pti 1 /p-1 k pti 1 /p-1 k 
a x? 
zm. dO ques e pe gr 
y=0 æ=1 y=0 æ=1 
ply ply 
p"-1 /p-1 ce k p" ^-1 /p-1 k 
= tz? yz? 
De d l9 
t=0 æ=1 y=0 æ=1 
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since p? = C and z?" = a?" (mod p"). Since 


p"—1 /p-1 ^N k 
p" F(p,m —1,k) = 5 or | H 


t=0 z-—1 


by the same computation as above, then 
hi 1 /p-1 E k 
p"! F(p,m,k) = p" F(p,m — 1, k) + SE i 


Each y, 0 € y € p"*! — 1, p Yy, may be uniquely written in the form 
y = hê (mod p"*?), with 0 < i € p"(p — 1). Also, each x?” (with 
1 < g< p—1) may be written uniquely as x?” = h^?" (mod p"*!) 
with 0 € c € p — 2. Hence 


E 1 Ei ) p" (p-1)-1 D T ) 
yr?" p i+cp 


ZE 


I 
L9 


I 
1 


i=0 
p"—1 
= (p-1) don 
i=0 


Thus 


D 


(2) Dividing by p — 1 and taking E into esch we deduce 
the recurrence relation for N(p,m;k). 


As a corollary, it follows from (4F) that 


(4.13) F(p) = | ma 
p p i=0 
E ee 
(4.14) N(p) = Tä, 
p D" i-o 


Let S(p”, k) = Ya [ni(p,m)]*. From the recurrence formulas in 
(4L) we obtain the expressions: 
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(4M) 
(1) F(p.m,k) = (DD + (DE SQ E) + S(p", k) + 
-- + S(p, k)]. 
(2) N(p,m, k) = SR ee EE k)+ 
-- + S(p, k)]. 
PROOF. (1) We have 
1 1 
F(p,m, k) = 5 Um m—1,k)+ ee” , k), 


1 1 pd zi 
—F(p,m-—1,k) = —F (p,m — 2, k) + S(p" ,k), 
S (p ) P ( ) E ( ) 


ES 1 
— F(p,1,k) = F(p,0, k) + Tan, k). 
p p" p 


Adding these equalities and taking into account (4G), we deduce 
that 


Ein = (=D  CD* + Sq") + S07, k) 


T + S(p, k)]. 


(2) This formula is obtained from the preceding one by dividing 
by p — 1. 


Taking k = 2 in the above formula and noting that N(p,m,2) — 1 
(trivially) then (4M) yields again the relation of Lemma 4.7 for k — 
m. Indeed, if m — 1, 


1— N(p,1,2) = Sle- 1)+ 1 + S(p,2)], 


hence 
p-1 


S(p,2) = So Ine, DP = (p — 1). 
j-0 
Assuming by induction that for r « m we have 


p'-1 


S(p^,2) = V [n(p,r)? = p'(p — 1), 


j=0 
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then 
1 = N(p,m,2) 
1 
= seule E ESUS hp E 


Hence S(p”, k) = p"(p — 1). So S(p", kl is an integer which is a 
multiple of p". 

For the next result, which is about sums of powers of the Gaussian 
periods, we shall require a lemma about the q-adic values of products 
of factorials. 


LEMMA 4.10. Let q be a prime, s > 1, u > 1 and let ri,r2,... ,r, be 
integers greater than 0 such that q* = rı +7r2+---+7,. Then 


valrai! ra! pl) € qu,(q^-!!), 


and qu = q, HS = -e = ra = qê then valgt! -q5 = 
av (a1). 


PROOF. If s = 1 the statement is trivial (since u > 1). We proceed 
by induction on s. 

We have [ri/q] + --- + [r./q] ol, so for some integer ro’ > 0 
we have rj + [ri/q] +---+ [r./q] = q?^^!. By induction, 


vu, [eT son 


By Chapter II, (1A), if t > 1 is an integer then 


vo-[DJ) se 


and if x > 0 is any real number, a > 1 any integer, then [x/a] = 


[x] /a], so 
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SE. bat) 


i ii q d 
< q*! t wir, KH D 
< q? ! + quy(qr- Si 
eg AT oe ee) 
= q(q^? EE Mo + q + 1) E oul TI. 


The last assertion is of course trivial. 


Now we may derive the following properties of the sums S(p”, k): 


(4N) 
(1) S(p”, k) = p"[pN (p, m, k)—N(p, m—1, k)] = p" |F (p,m, k)— 
F*(p, m, k)]. 
(2) If q is a prime, k = q^, then S(p™,q*) is a multiple of q. 


PROOF. (1) By (3L) and (4H), 


S(p”, k) = p" [pN (p, m, k) = N(p,m = 1,k)] 
= p" [(p = 1)N(p, m, k) Ce F"(p,m, k)] 
= p" [F(p, m, k) — F*(p,m, k)). 


(2) By virtue of (1) we may assume q Z p and it suffices to show 
that q divides F(p,m, q^) and F*(p,m,q*) = F(p,m,q*;c), where 
l<c<p-le=q SE (by (4H)). Let z1,25,...,24 € Z 
be : such that zf + de +- +r} =0 (mod p™*'), respectively, 
ge quaB shes kg c on p" where 1 < x; < p"!-1 
for à = 1,... ER ke q Each k-tuple obtained by a permutation 
from (2 ,%2,... , Zp) is still a solution of the congruence. So the set 
of solutions is organized into disjoint classes of solutions equivalent 
under permutation. Therefore it suffices to show that the number of 
solutions equivalent to any given one is a multiple of q. 


Let the k-tuple (£1, £2,... ,æ},) have exactly u distinct compo- 
nents, repeated, respectively, ri, r2,... ,r,, times (with r; > 1). We 
note that u Æ 1; otherwise from xz, = £2 = ++- = x, we deduce 


qx? = 0, EE (mod p), which is impossible. It follows 
that rı < k,... ras 
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The number of solutions obtained by permutation from (2, £2, 
. ,£,) is equal to K!/(ri! rs! r,!). So we need to prove that q 
divides this number. We have 


ug(k!) = ge aq? gH. 


Since rı + r2 +: +7, = qê and u > 1 then w(rilr2l-.-r,l) 

iu! qq" ere a a cae e e 
Therefore the q-adic valuation of k!/(ri! rə! - - - r,,!) is at least equal 

to 1, which was to be proved. 


IA 


We now indicate an upper bound for N(p) in terms of the periods. 
In view of (4.13) we are led to find an upper bound for the sum 
$2? nj. For this purpose we establish the following lemma: 


LEMMA 4.11. Let n > 3, let f be the function of n real variables 


TL 


TE aces 273 = Soy? 
i=1 
defined on the set D of all points (y1,... , Yn) such that 37, yi = 0 
and $74 yi = n(n — 1). 
(1) If (yy, ... Yn) is a point where the function assumes a mazi- 
mum or a minimum, then there exists an integer T, 1 € T < 
n — 1, such that (up to a permutation of {1,... ,n}) 


n— 1 
yi UT (n ) T(n-T)’ 
n— 1 


Let yT be the point with the above coordinates. 

(2) fy") = nln — Dn = 27) (n — D/(T(n — TJ). FT) = 
—f(yT), f(y7) > 0 for1 € T < (n — 1)/2. 

(3) If T = 1 then f(y?) = n(n — 1)(n — 2) is the absolute mazi- 
mum of f on the given domain D. 

(4) If1 <T X n—1 then f(y") is the absolute maximum of 
f on the points y = (y1,... , Yn) € D such that y; € (n — 


T)v (n — 1)/(T(n — T)). 
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PROOF. (1) To find the points of maximum or minimum of the 
function F on the domain D, we employ the method of Lagrange 
multipliers. Let A, u be parameters to be determined and 


Fun... Ym) = TU, a+ (Su) ET 


i=1 i=1 
If (y1,... ,Yn) is an extreme point then (0F/Oy:)(y1,... ;,Yn) = 0 
(for à = 1,... ,n), that is, 
(4.15) 3y? +At+2uy,=0 (fori=1,...,n). 
Adding up these relations, we obtain 3n(n — 1) + nÀ = 0, hence 


àA = —3(n—1). Substituting this value in the above condition (4.15), 
we have 3y? + 2uy; — 3(n — 1) = 0, hence 


—pnzi vu + O(n = 1) 


4.16 = 
(4.16) y 5 
Since there are only two possible values for the coordinates then, up 
to a permutation of {1,... ,n}, there exists an integer T, 0 € T <n, 
such that 
—u + yu? + 9(n — 1) 
Wa = yr = 
3 
—u — Vu? -9(n—1 
YT+1 5 S Yn = l ), 


3 


We note that not all coordinates can be equal since 577 , y; = 0. 
Hence 1 € T € n — 1. Moreover, 


" 
0—- -S3(T*n-T)*gy p? t9(n- D(T n T), 
hence 
nu = (2T — n) Vi?  9(n — 1); 
therefore 
ny? = (2T — n}? lu? + 9(n — 1)] 
and finally 
| 3(2T — n) n—i 
2 T(n — T)' 
— 1)n? 
"Nd ja )n 


— 4T(n - T) 
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Substituting into (4.16) we obtain 


n—1 
(n — T) Tin T or 
(4.17) yi = "uc 
Ex T(n— T) 


(2) For the point y^ with above coordinates, we have 


i —1 -> 
2/4 c ner ` DVT L 
i=1 T(n-T 
ER a 
(n DVT 
= 
n-—i 


— n(n 


Let 


F(t) = n(n SE 


for 0 < t < n. Then f(n —t) = —f(t); if 0 « t< n/2 then f(t) > 0 
and f(y^) = f(T) for T = 1,2,...,n — 1. If0 «t € (n — 1)/2 then 
t(n — t) € (t -- 1)(n — t — 1), as seen at once. Hence 


n — 2t 3 n-—2t—2 
vt(n—-t) V(t+1)(n-t-1) 
and therefore f(y’) > f(y") > 0 for T = 1,2,... ,[n/2] — 1. 


(3) For T = 1 we have f(y') = n(n—1)(n—2). Since the function 
is continuous and defined on a closed and bounded domain, it has a 
maximum and a minimum. By (1) and (2) it follows that f has the 
absolute maximum at yt. 

(4 Let 1 € T € n — 1 and let Dr consist of those points of 
D with coordinates y; un y; < (n —T)y(n — 1)/(T(n - T)). 
If 1 < T <n-1 wen y! € Dr exactly when T < T”. Hence 
from f(y 1) > f(yT) > 0 for 2 € T' € [n/2] and f(yT) < 0 for 
T' > [n/2], we conclude that f(y") is the absolute maximum of f 
on the domain Dr. 
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With the above notations we have: 


(40) Let M = maxim; | i = 0,...,p — 1}, let T be the largest 


integer such that 
| p-1 
M < T j 
p—-1 
N(p) « 1+ (p — 2T) T(p- Ty 


PROOF. Since 3774 9; = 0 then 0 < M. By (4.6), M < p— 1. The 
1 
t) 


function 


is decreasing, f(1) = p—1, f(p-1) = Se mm there exists the largest 
integer T, 1 <T < "à — 1, SCH SE 


M < (p- T) oy 


p—1 p—1 
Yim=0, Sin? =p- 
i=0 i=0 


as seen before, then (m0,m,... , 7, 1) € Dr. By Lemma 4.11, f(yf) 
is the absolute maximum of f on Dr and we have 


Then 


Since 


p—1 E 


ye = f(n,- -- 1) fly") =p =1)(p—2T) 50-0) 


By (4.13), 
IG, Jue e 
N(p = — +; TJ; 
(p) S F2 
Sp? 0 l(p-2T) |, pad 
p p T(p-T) 
p—l 
<1+(p-2T 
) T(p-T) 
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We note that if 


p—1 
T(p- T)’ 


M = (p-T) 


then N(p) < 1+ M. Indeed, in the above proof we obtained 


roae Eee ped 


p p T(p — T) 
—2 — 1p- 2T 

m. p—lp M 
p p p-T 

<1+M. 


Now we study the asymptotic behavior of F(p, m, k), N(p,m, k) 
when k tends to infinity. 


(4P) 
: N(pm,k) ` 
eR QD Dep T 
PROOF. From (4M) we have 
N (p, m, k) Es (-1)* Se S(p, k) 
(p- Eent 1t cay t 07D > (ped 


"3 


etie @ D» Dy 


j=l i=1 


Since |mi(p, j)| < p — 1, by (4.6) it follows that 


im (ged) =o, 


hence Dm, N (p, m, k)/((p — 1) 1/p"*!) = 1. 


We consider the existence of p-adic solutions of a certain congru- 
ence. Let U, denote the multiplicative group of (p — 1)th roots of 1 
in the ring Z, of p-adic integers. We observe: 
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(4Q) 
(1) There exists an integer mo = mo(k,p) > 1 such that for 
every m > mo we have N(p,m,k) = N(p, mo, k). Let N(p, k) 
denote this number; N (p, k) > 0. 
(2) N(p,k) is the number of solutions of the equation 1 + X3 + 
4 Xy = 0 by elements in Û,. 


PROOF. (1) We have seen in (4H) that 0 € N(p,m,k) € N(p,m— 
1, k). Hence there exists mo with the property indicated. 
(2) Let m > mo and let 


k 
1+ Se =0 (mod p™t'), 

i=2 
where p J/z;. Since QUU mum (mod p"*7) then 


1+ Kë ae =0 (mod p™*?), 


SO 
k 


1+ 5 xip! = ap"! (mod p"*?) 
i=2 
for some a, 0 <a < p—1. Since m +1 > mo, by (4H), F*(p,m + 
1,k) = 0; therefore necessarily a = 0, so 


1+5 x" =0 (mod p”*?). 


In this way we have established the mapping 


(rz mod p™*",... , £y mod p 


— (x2 mod p™*?,... , 2, mod p 


a 


perm) 


from the set Sm of solutions of 
LA qossRX? =O" (mod pt) 


to the corresponding set Sm+1. This is clearly an injective mapping. 

Starting therefore with any (£ mod p"**!,... , a, mod p™+!) € 
Smo we obtain the sequences (Qm,i)m>mo, where Ami = a” (for 
i = 2,...,k). Since ae = a (mod p™*') then o441; = Omi 
(mod p"), so the sequence (0o; ;),, converges p-adically. 
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Let a; = lim s@x Then 1 + 05-F =F a, = 0 in Des: Since 
Pa = 1 (mod p") for m > mo then a? ! = 1, so o; € U, 


O^ i H 
and (oz... oul € S, the set of solutions in Ü, of the equation 
1+ X2+---+X; = 0. So we have an injective mapping o from Smo 
into $. 

On the other hand, if a; € Us a; (Ym,i)m>0 (for i = 2,... VE) 
and 1 +az +: +a, = 0, since oi ^ — 1 then y^; = 1 (mod p") 


(for i = 2,... , k and all m sufficiently large; we may take m > mo). 
p (mod pres) 


Thus Ym, = Tmi and 


1+ e Tec gy — (0 (mod qi 
In particular, 
lama esca =0 (mod sr, 


SO (Z,,23,... ,Tmk) € Smo- It is now immediate that the mapping c 
associates with this solution in Sm, the given solution in S, thus c 


is surjective. We conclude that the number of elements in $ is equal 
to N(p, k) = € S$,4,- 


As a corollary, we have: 


(4R) If p = 1 (mod k) then there exist (p — 1)th roots of 1, 
Q5, ... , Oy € Up, such that 1 + az +: + ax = 0. 


PROOF. By (4A), for every m > 1 the congruence 1 + AR Cod 
X?" =0 (mod p"*!) has a nontrivial solution. In particular, N (p, k) 
> 1, hence by (4Q), there exist az,... , az € Û, such that 1 + o3 + 
+a, =0. D 


We reproduce now the following tables, computed by Klósgen, 
which give N(p,m,k) for low values of the argument. Table for 
N (p, 1, k): 
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3 4 5 6 7 
51/10 9 0 100 39 
7 12 15 60 340 1680 
11/0 31 24 1600 5250 
13|2 33 200 2260 21630 
1710 57 140 6220 50120 
192 51 390 6880 101430 


Table for N(p,2,k) (italicized are the values which are not the 
same as in the table above): 


3 4 5 6 7 
51/10 9 0 100 0 
7 12 15 60 340 1680 
1110 27 24 1090 2520 
1312 33 150 1930 15540 
1710 45 0 3160 945 
1912 51 300 4600 44520 


To conclude this section, we shall discuss a heuristic method to 
indicate the probability for the congruence 


1-Y?-Z?z0 (mod p°) 


to have a given number of equivalent classes of nontrivial solutions. 
We exclude also the cyclic solutions from these considerations. 
Let 1 € a € p — 2 and let 


1 
M, = {a mod p, — mod p, —(1 + a) mod p, — mod p, 
a 


1 
(1 + a) 


——— mod p]. 


M, consists of six distinct elements (all different from 0, — 1), except 
in the following cases: 
(a) a = 1,p—2or (p—1)/2; then M, = {1 mod p,(p — 2) 
mod p, (p — 1)/2 mod p}; and 
(b) a Z1, a? =1 (mod p); then M = {a mod p,a? mod p}. 
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This latter case happens if and only if p=1 (mod 6). 

Thus, we have a partition of {1,2,... ,p — 2} into disjoint classes. 
If p = 6n +1, the number of such classes M, (with more than 
two elements) is equal to n. Indeed, if p = 6n + 1, this number 
is 1 + (p— 2 — 2 — 3)/6 = n. If p = 6n — 1 then the number is 
again 1 + (p — 2 — 3)/6 = n. In each class M, (with more than two 
elements) let à be the smallest integer, 1 € à < p — 2, such that 
à € M,. Clearly à € (p — 1)/2. 

If 1-- y? +2? =0 (mod p?) then necessarily z = —(1+y) (mod p). 
With this solution we associate ÿ, and we note that (1--g)^ —1— g? = 
0 (mod p?). If 1 + y? +z” = 0 (mod p?), this is an equivalent 
solution to the above one, if and only if y” mod p € M,, that is, 
y = ÿ. 

By Fermat’s little theorem, if t > 1 then (1 +t) —1—t* =0 
(mod p) hence (1 +t)? — 1 — t? = t(p)p (mod p?) where 0 < t(p) < 
p — 1. So the solutions of the congruence correspond to the integers 
t, 1 <t< (p—1)/2 such that t(p) = 0. 

We consider the sequence (t(p))1<(p—1)/2- 


(4S) | Assuming that the sequences (t(p))t<(p—1y/2 are random, for 
all primes p = 6n + 1 we have: 
(1) the probability that 1 + Y? + Z? =0 (mod p?) have only the 
trivial or cyclic solution is equal to ((p — 1)/p)^. 
(2) The probability that the above congruence have r (nontrivial, 
noncyclic) equivalence classes of solutions is equal to 


1 fn (? — y 
p” \r p | 
(3) The density of primes for which there are exactly r (nontrivial, 
noncyclic) equivalence classes of solutions is equal to 


1 1 
rl 6" Ve 


PROOF. (1) As already indicated, there are n equivalence classes in 
{1,2,...,p — 2} consisting of at least three elements. Let us denote 
them by Mz, Mzz, Sao , Ms,. 

Mz, consists of solutions of (1 + t)? = 1 + t? (mod p?) exactly 
when i;(p) = 0. Since the sequence (%;(p))z,<(p—1)/2 is random, the 
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probability is 1/p. Hence, the probability that none of the n classes 
M;,(p) consist of solutions is (1 — 1/p)". 

(2) In the same way the probability that r among the n classes 
M, consist of solutions is 


(3) The density in question is equal to 


S 
Il 
Te 
SE 
IIS 
E DS 
E 
Dow à 
ATN 
Ki 
S|] | 
= 
NE 
3 
1 


| 
E 


If p = 6n + 1 then 


1 6n \or/6 1 1 
DS == fer (- z | 
Grr! noo (= T z) Grrl We 


Similarly, if p = 6n — 1 then 


p 1 S =e y (1 1 » 
67r! n— V 6n— 1 6n—1 


1 1 
6"r! Je 


Klósgen computed the solutions of the congruence 1+Y?+Z? z 0 
(mod p?) for all primes p < 20 000. 

If 0 € r let v; (respectively, v;) be the number of primes p < 
20000, p = 1 (mod 6) (respectively, p = —1 (mod 6)) for which the 
above congruence has exactly r (nontrivial, noncyclic) equivalence 
classes of solutions. 

There are 1124 primes p such that p = 1 (mod 6) and p « 20000. 

Klósgen found that 


vf =970,  wi/1124— 86.3096, probability 84.35%. 
erede vy /1124 = 12.8196, probability 14.11%. 
Us = 9, v3 /1124 = 0.80%, probability 1.18%. 
ve v3 /1124 = 0.09%, probability 0.07%. 
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In a similar way for p = —1 (mod 6), p < 20000: 


vy = 957, vp /1136 = 84.24%, probability 84.35%. 
vj = 166, v; /1136 = 14.61%, probability 14.11%. 
v3 = 13, v; /1136 = 1.15%, probability 1.18%. 
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XI 
Epilogue 


This book about Fermat’s last theorem was written for the enjoy- 
ment of amateurs. Most of the proofs are given in full detail and use 
only elementary and easily understandable methods. For this reason, 
it was imperative to exclude developments depending on the study 
of ideals of number fields or on more sophisticated theories. How- 
ever, in this final part we indicate the more important achievements 
which could not be dealt with using elementary methods. We also 
give a succinct description of the approach to the proof of Fermat’s 
last theorem. To help the reader who wants to know more about 
these matters, a bibliography of important articles is also included. 


XL1. Attempts 


In this section, we give a brief overview of various approaches to the 
proof of Fermat’s last theorem. They were not quite successful but 
should not be dismissed. At their time, these results raised hopes for 
the proof of Fermat’s last theorem and led to new research problems 
of independent interest. 
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A. The Theorem of Kummer. 
In 1847, Kummer proved the following important theorem: 


If p > 2 is a regular prime, then Fermat’s last theorem is true for 
the exponent p. 


The concept of a regular prime needs explanation. It may be 
defined in terms of the class number of cyclotomic fields or by means 
of Bernoulli numbers. 

The Bernoulli numbers Bo, B1, B2,... are defined recursively: 


Bo=1 


and for n > 1, 


1 1 1 


Thus B, = =, B = e B, = 0,.... It is easily seen that Banz 
0 for all k > 1. The prime number p is regular if p does not divide the 
numerators of the Bernoulli numbers Bo, B4,... ,B, 5, B,_3. Let p 
be an odd prime, let 


Cp = cos(27/p) + isin(27/p) 


be a primitive pth root of 1. Let Q(G,) be the pth cyclotomic field; 
it consists of all complex numbers of the form 


To + rıCp Ra N 


with ro,71,...,7,-2 € Q. The class number h, of Q(G,) is a certain 
positive integer attached to Q(G,); it is the number of classes of 
ideals of Q(¢,), but we shall not explain these concepts any further 
(see any book on the theory of algebraic numbers, like the one by 
Borevich and Shafarevich (1966), or even this author's own book 
(1999)). Kummer showed that the prime p is regular if and only if 
p does not divide hy. 

'The smallest irregular prime is 37. It is known that there are in- 
finitely many irregular primes. On the other hand, it is conjectured, 
but it has never been proved, that there are infinitely many regular 
primes. 

'The method of Kummer could be extended to deal also with many 
irregular primes. However, with these methods it was never possible 
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to establish that Fermat’s last theorem is true for infinitely many 
prime exponents. 

To determine if a prime p is regular is not a simple matter as 
soon as p is large, because the numerators of the Bernoulli numbers 
become very large. Noting that what is required is to ascertain 
that the exponent p does not divide the numerators of B5,... , Bj. 3 
(rather than calculating their numerators), Lehmer, Lehmer , and 
Vandiver gave a criterion which was possible to implement for actual 
calculations. In this way it was shown (at a time when the proof of 
FLT was not yet discovered) that FLT is true for all prime exponents 
up to 4 x 109 (see Buhler et al. (1993)). 
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B. The Theorem of Wieferich. 


In 1909, Wieferich proved: 


If the first case of FLT is false for the exponent p then 
2-1 = 1 mod p’. 


This is a criterion involving only the exponent p and none of the 
hypothetical nonzero solutions x,y,z of X? + Y? = ZP. It was 
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immediately noted that no very small prime p satisfies the above 
congruence. Before the age of computers Meissner proved in 1913 
that p = 1093 is the smallest prime with the above-mentioned prop- 
erty. It is difficult to imagine the amount of calculations which was 
required. A further example, p = 3511, was found by Beeger in 
1921. Further computations by Lehmer, Keller, Clark and lately by 
Crandall, Dilcher, and Pomerance have shown that no other prime 
p < 4 x 10” satisfies the congruence. 

Other criteria of a similar kind were discovered by Mirimanoff, 
Vandiver, Frobenius, Pollaczek, Rosser, and Granville and Monagan, 
namely, 


If the first case of FLT is false for the exponent p then 
PTE = 1 mod p’ 


for all primes l < 89. 


A clever combinatorial combination of these criteria, by Gunder- 
son and Coppersmith, followed by extensive calculations (Granville 
and Monagan, Tanner and Wagstaff), allowed us to show that the 
first case of FLT is true for every exponent p < 6.93 x 101”. All this 
was done before the discovery of the proof of FLT for all exponents. 
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C. The First Case of Fermat's Last Theorem for Infinitely 
Many Prime Exponents. 


Using methods from sieve theory, Adleman, Heath-Brown, and Fou- 
vry proved in 1985: 


There exists an infinite set S of prime numbers, such that the first 
case of Fermat’s last theorem is true for every exponent p € S. 


A stronger result, valid not only for the first case, could not be 
established with the same methods. This theorem represented an 
important advance at that time. The method of proof was inspired 
from the old ideas of Sophie Germain and was connected with the es- 
timation of the size of the smallest prime in arithmetic progressions; 
the use of refined sieve theory was essential. 

'The infinite set S, guaranteed by the theorem, is not effectively 
defined, so it is not possible, with the method of the proof, to deduce 
for any given p that the first case of FLT holds for p. 

Once again this substantial theorem is obsolete, due to the proof 
of FLT for all exponents n > 2. 
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D. The Theorem of Faltings. 


Mordell observed and conjectured that irreducible curves defined by 
homogeneous polynomials of high degree in three variables with ra- 
tional coefficients should have only finitely many rational points, 
when they have few singularities, all of lower order. The exact con- 
jecture is expressed in terms of the genus of the curve, a concept 
which will not be explained here. In a remarkable paper Faltings 
proved, among many other theorems, Mordell's conjecture. In the 
particular case of Fermat's equation his result becomes: 
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For every n > 3, there exist at most finitely many triples (x, y, z) 
where x,y,z are integers, not all equal to 0 and such that gcd(x, y, z) 
= ] and x" + y^ = z^. 


Despite its importance, this result could not lead to the proof of 
FLT. However it was used, independently by Granville and Heath- 
Brown, to deduce that the set of exponents n > 3 for which FLT is 
true has density one. This method of Granville or Heath-Brown is 
also applicable to a very wide class of exponential diophantine equa- 
tions (see Ribenboim, 1993), the conclusion being the zero density 
for the exponents for which the equations have nontrivial solution. 
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E. The (abc) Conjecture. 


The (abc) conjecture, attributed to Masser and Oesterlé, was in- 
spired by a result about polynomials, due to Mason. The conjecture 
is stated as follows: 
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For any e > 0 there exists a number C(e) > 0 such that if a,b,c 
are integers, 1<a<b<c, with c = a +b and gcd(a,b,c) = 1 then 
c < C(e)r'^*, where r is the product of the distinct prime factors of 
abc. 


Intuitively, as an example, if a = 2", b = 3" (with m,n large) 
then c = a + b is large, so the conjecture states that c must have a 
large prime factor or a large number of prime factors, so that r is 
large. 

It is easy to show that the (abc) conjecture implies: 


FLT is true for all sufficiently large exponents. 


Indeed, let n > 3 and assume that x,y,z are positive integers, 
such that gcd(z, y, zl = 1 and a” + y^ = z". Let e = 1, so by the 
(abc) conjecture z” < C(1)r?? where 


r= 2e re 


Plenum zm plzyz 


so 2” < C(4)z°”?. This shows that there exists no such that n < no, 
in other words, FLT is true for every exponent n > no, or in short, 
FLT is asymptotically true. 

The (abc) conjecture is known to imply many other statements in 
number theory which have never been proved, as well as Mordell's 
conjecture which was proved by Faltings. The proof of the (abc) 
conjecture should be very difficult and this is presently the object of 
intense research. 


Bibliography 


1984 Mason, R.C., Diophantine Equations over Function Fields, 
London Math. Soc. Lecture Notes Ser., No. 96, Cambridge 
University Press, Cambridge, 1984. 

1985 Masser, D.W., Some open problems, Symp. Analytic Number 
Theory, Imperial College, London, 1985 (unpublished). 

1988 Oesterlé, J., Nouvelles approches au théoréme de Fermat, 
Sém. Bourbaki, 40ème année, No. 694, Février 1988, 1987— 
1988. 

1990 Masser, D.W., More on a conjecture of Szpiro, Astérisque, 
183 (1990), 19-24. 


366 XI. Epilogue 


1991 Elkies, N.D., ABC implies Mordell, Duke Math. J., Intern. 
Math. Res. Notes, 7 (1991), 99-109. 


XL2. Victory, or the Second Death of Fermat 


Mathematicians have the obligation of solving problems. When a 
long-sought proof, like the one for Fermat’s last theorem, is finally 
discovered, it is the moment of crying VICTORY. 

On June 23, 1993, in the third of his lectures at the Newton Insti- 
tute in Cambridge, England, Wiles announced the proof of Fermat’s 
last theorem. His manuscript, scrutinized by various experts, re- 
vealed flaws which needed corrections. Undeterred and with the help 
of Taylor, Wiles found a way out of the difficulties and in October 
1994, he made public two manuscripts, one co-authored by Taylor. 
They contain the proof of the conjecture of Shimura-Taniyama, for 
the case of semistable elliptic curves. According to the previous work 
of Ribet this entails that FLT is true. For most mathematicians this 
represents the end of the saga. Wiles deserves the admiration of all 
mathematicians for his achievement. The method used has already 
been applied to other diophantine equations. Wiles' work was the 
final step in a new strategy which will be evoked shortly. 

There are some mathematicians who are not satisfied with the 
method of proof using elliptic curves and modular forms, considered 
— perhaps wrongly? or rightly? — to be extraneous to the problem. 
It is a legitimate task to try to find another, simpler, proof of FLT. 
But the solution of Fermat's problem also harbors a negative aspect 
and a tear of regret is unavoidable, because mathematicians also 
like unsolved problems to stimulate their research, just like night 
butterflies are attracted by intense sources of light. 'l'he study of 
Fermat’s theorem led to the creation of the theory of algebraic num- 
bers, in the same way as the study of quadratic fields was prompted 
by Gauss! theory of quadratic forms. The branch of mathematics 
which is the confluence of number theory and algebraic geometry, 
called Arithmetic Algebraic Geometry, developed not only by its in- 
ternal problems, but also in view of solving Fermat's last theorem. 
The attempts to prove Fermat’s theorem, the old and the new, show 
a myriad of interesting ideas in many directions of number theory, 
by illustrious names. Will this stimulation disappear now that FLT 
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is proved? Not at all. Variants of the problem, generalizations to 
higher dimensions, will continue tantalizing mathematicians. So, 
we celebrate this striking victory and admire our colleagues who, 
through effort and ingenuity, succeeded in solving the problem. 
The proof of Fermat’s last theorem must be indirect. We assume 
that there exists n > 3 and positive integers a, b, c such that a? +b” = 
c”. The aim is to deduce a statement which is known to be false. 
No contradictions were found with statements in elementary number 
theory, nor with statements about number fields, nor for that matter, 
for any other statements until the expression of FLT in terms of 
elliptic curves. The proof of FLT was established with the following 
steps: 
(I) To associate an elliptic curve to a hypothetical nontrivial 
solution of Fermat's equation, with arbitrary exponent n > 5. 
(II) To obtain a contradiction to the assumption of validity of a 
certain conjecture about elliptic curves and modular forms. 
(III) To prove the validity of the conjecture. 


'These steps require sophisticated concepts and theories, far be- 
yond the level of this book and the knowledge usually expected from 
amateurs — and also from professional mathematicians working in 
other disciplines. My task is difficult if not hopeless. What will 
follow is simple-minded and superficial, but still mysterious and per- 
haps out of grasp for anyone who is not yet familiar with the concepts 
involved. The key notions needed are elliptic curves, modular forms, 
and Galois representations. 


A. The Frey Curves. 


For relatively prime positive integers A, B and A divisible by 16, 
Frey considered the elliptic curve of equation 


(2.1) Y? = X(X -A(X +B) 


(see Chapter VIII, 81, (A6)) and studied its properties. 

If Fermat's last theorem is false for the prime exponent q > 5, 
let a,b,c be positive pairwise relatively prime integers, with a even, 
such that a? + b? = «1. Let A = a’, B = bg The associated 
Frey curve displayed properties in sharp contrast with those of other 
elliptic curves. Frey became convinced that such a situation was not 
possible and envisioned a method to derive a contradiction with the, 
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by then well-known, conjecture of Shimura-Taniyama (see below). 
But there were serious obstacles to overcome, which would require 
many years of work (see below). 

Here are some propeties of the Frey curves. The minimal discrim- 
inant of the Frey curve is 


| G^? (af +b)? (abc)? 


28 28 
Since A # 0, the curve is nonsingular, so it is an elliptic curve. 
For every prime p not dividing A, we consider the congruence 


(2.2) Y?’ = X(X — a*)(X 4- b*) (mod p). 


It defines a curve in the two-dimensional space over the finite field F,. 
Since p does not divide A, the curve is nonsingular, so it is an elliptic 
curve. On the other hand, if p divides A, the curve is singular. The 
type of singularities is encoded in the invariant called the conductor. 
'The primes p dividing the conductor are exactly those dividing the 
discriminant, that is, the primes p for which the curve in F, x F, has 
singularity. The exponent of p indicates the type of singularity. In 
the present case, where the singularities are nodes, the conductor N 
is square-free, so it is equal to 


N=|[p. 


plA 


Elliptic curves with square-free conductor are said to be semistable. 
Thus, Frey curves are semistable. As it was known, if Fermat’s last 
theorem is assumed false for the prime exponent q, then q has to be 
very large; moreover, since Fermat’s equation is homogeneous, the 
discriminant is a power — and this seemed unlikely to be possible. 
We shall count the number of points of Frey’s curve modulo p (for 
every p not dividing A). To this count we add 1, which corresponds 
to the point at infinity in the associated projective curve. Let vp 
be the number of points and let a, = p + 1 — v, (a, need not be 
positive). We pause to recall that in Chapter I, 81, we studied the 
Pythagorean equation X?+Y? = 1 modulo all odd primes; we proved 
that the numbers a, defined there are easily determined by a simple 
congruence for the prime p. Similar considerations are important for 
all elliptic curves (not only for Frey curves). The discriminant, the 
conductor, and the integers a, (for p not dividing the discriminant) 
are defined and studied in the same spirit. Elliptic curves which 
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can be given by an equation with coefficients in Q are said to be 
defined over Q. The rule for determination of the integers a, involves 
modular forms. 


B. Modular Forms and the Conjecture of Shimura-Tani- 
yama. 


Let N > 1 be an integer. Let l'o(/:N) be the set of all 2 x 2 ma- 


trices 
a b 
c d?’ 


where a, b, c, d are integers, N divides c and ad — bc = 1. To(N) isa 
multiplicative group called the congruence group of level N. Let H 
denote the upper half-plane, that is, H = {z=x+iyeEC|y> 0]. 
To(N) acts on H as follows: 


a b az +b 
(2.3) » le eg 


for all matrices of Fo(N) and z € H. Associated to the group 
To(N) there are finitely many special points (which we do not define 
here), called cusps; these are the point at infinity of the half-line 
{iy | y > 0} and other points in H U Q (when N > 1). 

A modular form of level N (and weight 2 — the only ones we wish 
to consider) is a map f from H* = H U {cusps of D9(N)) to C such 
that: 


(i) for all Se 1) € l'o(N) and z € H*: 
(24) s (35) = e+ 16 


(ii) f is holomorphic at every point of H* (this requires an ap- 
propriate definition at the cusps). 

A modular form which vanishes at all cusps is called a cusp form. 

The theory of modular forms is very rich. Here are some relevant 
facts (for which we give no hint of proof): 


(1) The set M2(N) of modular forms of level N and weight 2 is a 
finite-dimensional vector space over C and the subset of cusp forms is 
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a subspace. For the level N = 2 the subspace of cusp forms consists 
only of the form 0. 

(2) There is a natural inner product on M(N), so it is possible 
to consider orthogonality in M(N). 

(3) Let N > 1. If M divides N then M2(M) € M2(N). There 
is also the embedding from M(M) into M2(N) given as follows: 
if f € MN) let f(z) = f((N/M)z) for every z € H*; then f € 
M2(N). 

(4) A form f € Ma(N) is called an old form if f is in the subspace 
of M.(N) generated by the images of the mappings considered in 
(3), for all M dividing N. A form f € MA(N) is called a new form 
if it is in the subspace which is orthogonal to the subspace of old 
forms. 


(5) Since 
ir i ) € To(N) 


then f(z+1) = f(z) for each modular form and every z. Thus f 
has a Fourier expansion, which is of the form 


(2.5) f(x = Soe 


For cusp forms, co = 0. 

(6) Hecke defined for each n > 1 coprime to the level N, a linear 
operator T, of M(N). The Hecke operators commute: Tm o T;, = 
T, o Tm for all m,n coprime to the level. A modular form which is 
an eigenvalue for all Hecke operators T, is called an eigenform. 

Other operators associated to the integers n, not coprime with 
N, have also been introduced and, together with the above Hecke 
operators T,,, they generate a larger Hecke algebra, whose properties 
are of essential importance (see Wiles and Taylor, and also Lenstra). 
New forms of level N, which are eigenforms for each T, (with n 
coprime to N) are also eigenforms of the operators of the larger 
Hecke algebra. 

Now we discuss the relationship between elliptic curves and mod- 
ular forms. For a given elliptic curve, the numbers a; (for all primes 
p not dividing the discriminant) contain very important “local” in- 
formation about the curve (for each p). It is crucial to relate these 
local data by means of some “global” invariant. 
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This important idea is a sophisticated transfiguration of the fact 
that every natural number is the product of powers of primes in a 
unique way. Thus Euler already introduced this relation between an 
infinite product extended over all primes and infinite Dirichlet series, 
summed over all integers: 


EH 


p p 


First s was restricted to be a real number s > 1, for which both sides 
converge and are equal. Riemann had the idea, courageous and deep, 
of allowing s to be any complex number with Re(s) > 1. The above 
series is the Riemann zeta function. To prove the existence of in- 
finitely many primes in arithmetic progressions, Dirichlet considered 
"twisted" L-series, where the numerators are no longer 1, but val- 
ues of characters of appropriate finite Abelian groups; each series 
has also an abscissa of convergence and admits an Euler product, 
reflecting also the multiplicative property of characters. 

As for the Riemann zeta function, L-series of characters have only 
poles but no essential singularities at the boundary of the domain 
of convergence. Riemann proved that the functions definded above 
could be extended to the whole plane by analytic continuation, and 
even more remarkably, the values to the right and left of the bound- 
ary line are linked by a functional equation involving the gamma 
function. À great discovery and the royal road for analytical meth- 
ods to enter into number theory. 

In great analogy with number fields, elliptic curves also display 
very important analytical properties of the same kind. The local 
numbers a,, defined above (not forgetting finitely many factors at- 
tached to the primes dividing the discriminant), combine together 
multiplicatively to define numbers a,, (for every n > 1), thus leading 
to a Dirichlet series, called the L-series of the elliptic curve; they 
converge for Re(s) > 3. In computed examples it was observed that 
these L-series admit analytic continuations and functional equations. 
Hasse conjectured that this should be true for every elliptic curve. 
Deuring proved it for the elliptic curves admitting more “symme- 
tries" namely those with complex multiplication. 

For a certain time it had been observed by numerical calculations 
that for many specific elliptic curves the numbers a, coincide with the 
coefficients c, of the Fourier series of some modular form. Elliptic 
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curves with the above property have been called modular elliptic 
curves, or also Weil elliptic curves. 

In 1955, during the Tokyo—Nikko conference on number theory, 
Taniyama proposed problems, two of which concerned — if still 
somewhat imprecisely — the above question. If Hasse’s conjecture 
were true, would the L-series be associated to some automorphic 
function, or even to a modular form? These problems were dis- 
cussed with Shimura and Weil. By 1964, Shimura made known in 
his lectures a very specific conjecture (which however did not appear 
in print on that occasion). Weil contributed in an important way to 
the investigation of the modularity of elliptic curves. His paper (of 
1967) acknowledges previous communications by Shimura but does 
not contain a statement of the conjecture which he considered, even 
later, to be problematic. According to a well-documented study by 
Lang (1995), we shall adopt the name “Shimura—Taniyama conjec- 
ture” for this penetrating statement. 


(7) The Shimura—Taniyama Conjecture: Every elliptic curve 
is modular. 


This is a short way of expressing the following: 


If E is any elliptic curve defined over Q, if N is its conductor, then 
there is a new cusp eigenform f of level N, whose Fourier coefficients 
Cn are integers and such that for every prime p not dividing N, c, = 
a, (where a, is defined by counting the number of points of E in Fp). 


This conjecture says that the rule of determination of the integers 
a» is given by some modular form. 

(8) Shimura proved the converse of the Shimura-Taniyama con- 
jecture. Let f € M(N) be such that its Fourier coefficients are in 
Z. We explain how it is possible to associate an elliptic curve. Let 
zo H. For each y € TN) consider the integral 


KE 
wal) = fads 


it is independent of the path. The set {w.,(7) | y € Po(N)] is 


independent of zo, so it depends only on f. Using the fact that 
the Fourier coefficients of f are integers, the above set is a lattice 
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in H, that is, it is the set of all linear combinations, with integral 
coefficients, of two numbers in H* (the periods of f). This lattice 
gives rise in the usual manner to an analytic torus, hence to an elliptic 
curve E having an equation with coefficients in Z (thus E is defined 
over Q). Let C(N) = (f € MAN) | f is a cusp eigenform whose 
Fourier coefficients are integers]. The above construction associates 
to each f € GON) an elliptic curve E defined over Q. Moreover, the 
conductor of E is the level N of f and for each prime p not dividing 
the discriminant of E, the Fourier coefficient c, of f is equal to the 
number a, (associated to E and p as was already indicated). 

Analytical methods involving the L-series of elliptic curves, their 
Euler product, analytic continuation, and functional equation play 
a fundamental role. 


C. The Work of Ribet and Wiles. 


The work of Ribet involved an argument of descent concerning Ga- 
lois representations and modular forms. We need to explain how 
Serre attached Galois representations to any elliptic curve E defined 
over Q, that is, having an equation with integral coefficients. The 
set of points with complex coordinates (to which is added the point 
at infinity) constitutes an Abelian additive group, well defined by 
the stipulation that the point at infinity should be the zero for the 
addition. The addition is defined by the following rule: if P,Q, R 
are points on the curve, then P+ Q + R = 0 when P,Q,R lie on 
one line (clarifications are needed when P — Q or in some other 
special cases). If K is a subfield of C, let E(K) be the set of pairs 
of elements of K which satisfy the equation of E; then E(K) is a 
subgroup of E(C). 

For each prime p, it is equally possible to define the additive group 
E(F,). In the Abelian group E(C) we consider the set E(C)|p] of all 
elements of order dividing p. These are the point 0 and the points 
P such that P+ P 4---- 4 P (p times) is equal to 0. Then E(C)[p] 
is a subgroup of order p°, which is isomorphic to Z/p x Z/p. The 
coordinates of the points in E(C)[p] are in some Galois extension K 
of finite degree over Q. The elements of the Galois group of K|Q 
act linearly on E(K) and permute among themselves the elements 
of E(C)|p]; by isomorphism with Z/p x Z/p this gives rise to linear 
transformations of Z/p x Z/p. Thus, we obtain a representation, 


374 XI. Epilogue 


associated to E, of the Galois group of K|Q. It is usual to consider 
the field Q of all algebraic numbers; it has infinite degree and con- 
tains K. The Galois group of K|Q is a quotient of the Galois group 
G of Q|Q. So we obtain a representation pg, from G by a group 
of 2 x 2 matrices with entries in Z/p = F, (attention is also paid 
to the natural Krull topology of G). Similar considerations lead to 
representations pg p» by means of 2 x 2 matrices with entries in Z/p” 
(for all n > 1). All the representations pg p» (for n > 1) fit together 
to produce a representation os. of G by 2 x 2 matrices with entries 
in the field Q, of p-adic numbers. 

It is also possible to attach to any eigenform f with Fourier coeffi- 
cients in Z, a representation Gro, of G by 2 x 2 matrices with entries 
in Q,. If E is the elliptic curve associated to f, as indicated before, 
then the representations pro, and pg,o, are isomorphic. 

We outline the proof of Ribet. Assume that FLT' is false for the 
exponent q, let E be the Frey curve associated to a hypothetical so- 
lution; E is a semistable elliptic curve. Assuming that the conjecture 
of Shimura-Taniyama is valid, there exists a new cusp eigenform f of 
weight 2 and level equal to the conductor N of E. Then pro, = PEQ, 

Ribet proved that if p is an odd prime dividing N, N, = N/p, 
then there is a new cusp eigenform fı of weight 2 and level N;, such 
that pg, = Per, The argument may be repeated, leading to a 
nonzero cusp form of weight 2 and level 2 — which is impossible. 

Wiles proved that the Shimura- Taniyama conjecture is valid for 
semistable elliptic curves, in particular for the Frey curve. The the- 
ory of deformation of representations, created by Mazur, plays a 
great role; so does the result (proved with Taylor, and also later 
by Lenstra) on the structure of the commutative algebra generated 
by the Hecke operators. Cohomological results were developed and 
used in essential ways. The proof is at a maximal level of sophistica- 
tion, so it is impossible to report in an intelligent way in this book. 
Among the expository papers listed in the Bibliography, we may 
recommend the one by Gouvéa which delineates the proof, avoiding 
technical details, making it accessible for the courageous amateur. 
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For my readers who are still courageous, I include a bibliography 
which lists not only research papers, but also expository material 
and should be explored by readers wishing to enter deeper into the 
proof of Wiles. 

For the convenience of the reader, the references are organized as 
follows: 


A. Elliptic Curves, Modular Forms: Basic Texts. 


1962 Gunning, R.C., Lectures on Modular Forms, Princeton Uni- 
versity Press, Princeton, NJ, 1962. 

1971 Shimura, G., Introduction to the Theory of Automorphic 
Functions, Princeton University Press, Princeton, NJ, 1971. 

1972 Ogg, A., Survey of modular functions of one variable, in: 
Modular Functions of One Variable (editor, W. Kuyk), Sprin- 
ger- Verlag, New York, 1972. 

1974 Tate, J., The arithmetic of elliptic curves, Invent. Math., 23 
(1974), 179—206. 

1976 Lang, S., Introduction to Modular Forms, Springer-Verlag, 
New York, 1976. 

1984 Koblitz, N., Introduction to Elliptic Curves and Modular 
Forms, Springer-Verlag, New York, 1984. 

1986 Silverman, J.H., The Arithmetic of Elliptic Curves, Springer- 
Verlag, New York, 1986. 

1986 Cornell, G. and Silverman, J.H. (editors), Arithmetic Geom- 
etry, Springer-Verlag, Berlin, 1986. 

1989 Miyake, T., Modular Forms, Springer- Verlag, New York, 1989. 

1989 Hida, H., Theory of p-adic Hecke algebras and Galois repre- 
sentations, Sigaku Expositions, 2 (1989), 75-102. 

1989 Gouvéa, F.Q., Formas Modulares, uma Introducáo, Instituto 
de Matematica Pura e Aplicada, Rio de Janeiro, 1989. 

1991 Cassels, J.W.S., Lectures on Elliptic Curves, Cambridge Uni- 
versity Press, Cambridge, 1991. 

1992 Tate, J. and Silverman, J.H., Rational Points on Elliptic 
Curves, Springer-Verlag, New York, 1992. 
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B. Expository. 


1988 


1989 


1990 


1993 


1993 


1993 


1993 


1994 


1994 


1994 


1994 


1994 


1995 


1995 


1995 


Cipra, B.A., Fermat’s last corollary?, Focus, March-April 
1988, pp. 2 and 6. 

Shimura, G., Yataku Taniyama and his time. Very personal 
recollections, Bull. London Math. Soc., 21 (1989), 186-196. 
Ribet, K.A., From the Taniyama-Shimura conjecture to Fer- 
mat’s last theorem, Ann. Fac. Sci. Toulouse Math., (5), 11 
(1990), no. 1, 116-139. 

Murty, M. Ram, Fermat’s last theorem, an outline, Gaz. Soc. 
Math. Québec, 16 (1993), No. 1, 4-13. 

Murty, M. Ram, Topics in Number Theory, Mehta Res. Inst. 
Lect. Notes, No. 1, Allahabad, 1993. 

Frey, G., Uber A. Wiles’ Beweis der Fermatschen Vermutung, 
Math. Semesterber., 40 (1993), no. 2, 177-191. 

Ribet, K.A., Modular elliptic curves and Fermat’s last the- 
orem, Videocassette, 100 min., Amer. Math. Soc., Provi- 
dence, RI. 

Gouvéa, F.Q., A marvelous proof, Amer. Math. Monthly, 
101 (1994), 203-222. (Updated Portuguese translation: Ma- 
tem. Univ., no. 19, Dec. 1995, pp. 16-43.) 

Cox, D.A., Introduction to Fermat’s last theorem, Amer. 
Math. Monthly, 101 (1994), 3-14. 

Rubin, K. and Silverberg, A., Wiles’ Cambridge lecture, Bull. 
Amer. Math. Soc., 11 (1994), 15-38. 

Ribet, K.A. and Hayes, B., Fermat’s last theorem and modern 
arithmetic, American Scientist, March-April 1994, pp. 146- 
156. 

Ribet, K.A., Wiles proves Taniyama’s conjecture; Fermat’s 
last theorem follows, Notices Amer. Math. Soc., 40 (1993), 
no. 6, 575-576. 

Ribenboim, P., Fermat’s last theorem before June 23, 1993, 
in: Proc. Fourth Conference Canad. Number Theory Assoc., 
Halifax, July 1994 (editor, K. Dilcher), Amer. Math. Soc., 
Providence, RI, 1995, pp. 279-293. 

Schoof, R., Wiles’ proof of Taniyama-Weil conjecture for 
semi-stable elliptic curves over Q, Gaz. Math., Soc. Math. 
France, No. 66 (1995), 7-24. 

Edixhoven, B., Le rôle de la conjecture de Serre dans la 
démonstration du théorème de Fermat, Gaz. Math., Soc. 


1995 


1995 


1995 


1995 


1995 


1995 


1995 


1996 


1996 


1997 


1997 


1997 


1997 
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Math. France, No. 66 (1995), 25-41. (Erratum and ad- 
dendum: Gaz. Math., Soc. Math. France, No. 67 (1996), 
19.) 

Lang, S., Some history of the Shimura—Taniyama conjecture, 
Notices Amer. Math. Soc., 42 (1995), no. 11, 1301-1307. 
Faltings, G., The proof of Fermat’s last theorem by R. Taylor 
and A. Wiles, Notices Amer. Math. Soc., 42 (1995), no. 7, 
743-746. 

Serre, J.-P., Travaux de Wiles (et Taylor, ...), Partie I, 
Séminaire Bourbaki, Vol. 1994/95. Astérisque, No. 237 
(1996), Exp. No. 803, 5, 319-332. 

Oesterlé, J., Travaux de Wiles (et Taylor, ...), Partie II, 
Séminaire Bourbaki, Vol. 1994/95. Astérisque, No. 237 
(1996), Exp. No. 804, 5, 333-355. 

Darmon, H., Diamond, F., and Taylor, R., Fermat’s last the- 
orem. In: Current Developments in Mathematics, 1995 (ed- 
itors, R. Bott, A. Jaffe, and S.T. Yau), pp. 1-154, Internat. 
Press, Cambridge, MA, 1995. Also in: Elliptic curves, mod- 
ular forms D Fermat’s last theorem (Hong Kong, 1993), pp. 
2-140, Internat. Press, Cambridge, MA, 1997. 

Gouvéa, F.Q., Deforming Galois representations: a survey. 
In: Seminar on Fermat’s Last Theorem (Toronto, ON, 1993- 
1994), pp. 179-207, CMS Conf. Proc., No. 17, Amer. Math. 
Soc., Providence, RI, 1995. 

Mazur, B., Fermat’s last theorem, Videocassette, 60 min., 
American Math. Soc., Providence, RI. 

Darmon, H. and Levesque, C., Sommes infinies, équations 
diophantiennes et le dernier théorème de Fermat, Gaz. Soc. 
Math. Québec, 18 (1996), 3-18. 

van der Poorten, A., Notes on Fermat’s Last Theorem, Wiley, 
New York, 1996. 

Cornell, G., Silverman, J.H., and Stevens, G. (editors), Mod- 
ular Forms and Fermat’s Last Theorem, Springer-Verlag, New 
York, 1997. 

Singh, S., Fermat’s Enigma, Viking, London, 1997. 

Singh, S. and Ribet, K.A., Fermat’s last theorem, Scientific 
American, 277 (1997), no. 5, 36-41. 

Kani, E., Fermat’s last theorem, Queen's Math. Commu- 
nicator (Queen's University at Kingston, Ontario, Canada), 
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Summer 1997, pp. 1-8. 
1997 Frey, G., The way to the proof of Fermat’s last theorem, 
preprint, 20 pp., 1997. 


C. Research. 


1958 Shimura, G., Correspondances modulaires et les fonctions 
zeta de courbes algébriques, J. Math. Soc. Japan, 10 (1958), 
1-28. 

1961 Shimura, G., On the zeta-functions of the algebraic curves 
uniformized by certain automorphic functions, J. Math. Soc. 
Japan, 13 (1961), 275-331. 

1967 Shimura, G., Construction of class fields and zeta functions 
of algebraic curves, Ann. of Math., (2), 85 (1967), 58-159. 

1967 Weil, A., Uber die Bestimmung Dirichletscher Reihen durch 
Funktionalgleichungen, Math. Ann., 168 (1967), 149-156. 

1975 Hellegouarch, Y., Points d’ordre 2p sur les courbes ellip- 
tiques, Acta Arith., 26 (1975), 253-263. 

1977 Mazur, B., Modular curves and the Eisenstein ideal, Inst. 
Hautes Etudes Sci. Publ. Math., 47 (1977), 33-186 (1978). 

1982 Frey, G., Rationale Punkte auf Fermatkurven und getwisteten 
Modulkurven, J. Reine Angew. Math., 331 (1982), 185-191. 

1986 Frey, G., Elliptic curves and solutions of A — B = C, in: 
Sém. Th. Nombres, Paris, 1985-1986 (editor, C. Goldstein), 
Progress in Mathematics, Birkhauser, Boston, 1986, pp. 39- 
51. 

1986 Frey, G., Links between elliptic curves and certain diophan- 
tine equations, Ann. Univ. Sarav. Ser. Math., 1 (1986), No. 
1, 1-40. 

1987 Frey, G., Links between elliptic curves and solutions of A — 
B = C, J. Indian Math. Soc., 51 (1987), 117-145. 

1987 Frey, G., Links between solutions of A — B = C and ellip- 
tic curves, in: Number Theory (Ulm, 1987) (editors, H.-P. 
Schlickewei and E. Wirsing), Springer Lect. Notes in Math., 
No. 1380, Springer-Verlag, New York, 1989. 

1987 Serre, J.-P., Sur les représentations modulaires de degré 2 de 
Gal(Q|Q), Duke Math. J., 54 (1987), 179-230. 

1987-1990 Ribet, K.A., On modular representations of Gal(Q|Q), 
preprint, 1987. Invent. Math., 100 (1990), 115-139. 

1989 Mazur, B., Deforming Galois representations, in: Galois 
Groups over Q (editors, Y. Ihara, K.A. Ribet, and J.-P. 
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Serre), Math. Sci. Res. Inst. Publ., Vol. 16, Springer- 
Verlag, New York, 1989. 

1990 Ribet, K.A., From the Taniyama-Shimura conjecture to Fer- 
mat’s last theorem, Ann. Sci. Univ. Toulouse, (5), 11 (1990), 
115-139. 

1991 Ribet, K.A., Lowering the levels of modular representations 
without multiplicity one, Internat. Math. Res. Notices 1991, 
no. 2, 15-19. 

1991 Kolyvagin, V., Euler systems, in: The Grothendieck Fest- 
schrift, Vol. 2, pp. 435-483, Birkhàuser, Boston, 1991. 

1992 Flach, M., A finiteness theorem for the symmetric square of 
an elliptic curve, Invent. Math., 109 (1992), 307-327. 

1993 Lenstra, H.W. Jr., Complete intersections and Gorenstein 
rings, preprint (September 27, 1993). 

1993 Ramakrishna, R., On a variation of Mazur’s deformation 
functor, Compositio Math., 87 (1993), 269-286. 

1995 Ribet, K.A., Galois representations and modular forms, Bull. 
Amer. Math. Soc., 32 (1995), 375-401. 

1995 Wiles, A., Modular elliptic curves and Fermat’s last theorem, 
Ann. of Math., (2), 141 (1995), 443-551. 

1995 Taylor, R. and Wiles, A., Ring theoretic properties of certain 
Hecke algebras, Ann. of Math., (2), 141 (1995), 553-572. 
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As aconcluding note, here are timely communications by Karl Rubin 
which circulated widely. 


E-mail message no. 1: 


Date: June 23, 1993, 05:52:30 

Subject: big news 

Andrew Wiles just announced, at the end of his 3rd lecture here, 
that he has proved Fermat’s Last Theorem. He did this by proving 
that every semistable elliptic curve over Q (i.e. square-free conduc- 
tor) is modular. The curves that Frey writes down, arising from 
counterexamples to Fermat, are semistable and by work of Ribet they 
cannot be modular, so this does it. 

It’s an amazing piece of work. 

Karl 
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E-mail message no. 2: 


Date: Oct. 25, 1994, 10:24:46 
Subject: update on Fermat’s last theorem 
As of this morning, two manuscripts have been released: 


Modular elliptic curves and Fermat’s last theorem, 
by Andrew Wiles 

Ring theoretic properties of certain Hecke algebras, by 
Richard Taylor and Andrew Wiles. 


The first one (long) announces a proof of, among other things, 
Fermat’s last theorem, relying on the second one (short) for one 
crucial step. 

As most of you know, the argument described by Wiles in his Cam- 
bridge lectures turned out to have a serious gap, namely the construc- 
tion of an Euler system. After trying unsuccessfully to repair that 
construction, Wiles went back to a different approach, which he had 
tried earlier but abandoned in favor of the Euler systems idea. He was 
able to complete his proof, under the hypothesis that certain Hecke 
algebras are local complete intersections. This and the rest of the 
ideas described in Wiles’ Cambridge lectures are written up in the 
first manuscript. Jointly, Taylor and Wiles establish the necessary 
property of the Hecke algebras in the second paper. 

The overall outline of the argument is similar to the one Wiles de- 
scribed in Cambridge. The new approach turns out to be significantly 
simpler and shorter than the original one, because of the removal of 
the Euler system. (In fact, after seeing these manuscripts, Faltings 
has apparently come up with a further significant simplification of 
that part of the argument.) 

Versions of these manuscripts have been in the hands of a small 
number of people for (in some cases) a few weeks. While it is wise 
to be cautious for a little while longer, there is certainly reason for 
optimism. 

Karl Rubin 


Excitement, caution, and amazement in the face of a superlative 
feat in Mathematics. 


Appendix A 
References to Wrong Proofs 


It is well known that there have been literally thousands of wrong 
proofs of Fermat’s last theorem. This can be explained by the fact 
that the statement of the problem is easily understandable to an 
amateur. Moreover, there have been important prizes offered by 
academies and foundations which have stimulated efforts by dilet- 
tantes as well as professional mathematicians. 

Since the Wolfskehl Prize was established in 1908, in the first 
years alone, 621 wrong solutions were submitted, and today there 
are about 3 meters of file correspondence and proposed solutions of 
Fermat’s problem stored in Góttingen. 

We indicate below a list — obviously incomplete — of some noto- 
riously wrong published attempts to solve the problem. Even good 
professional mathematicians have not escaped from being included 
in the roll. 

Remarkable is the case of F. Lindemann, who discovered the tran- 
scendency of the number m. Yet, with respect to Fermat’s last the- 
orem, all his attempts failed. 

F. Paulet seems to have been one of the most persistent, with 
twelve submissions to the Academy of Sciences of Paris, spanning 
the years 1841 to 1862, but not bringing any progress to the investi- 
gation. 
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With only a few exceptions, we do not mention wrong solutions 
published by the authors as independent books or brochures; some 
of these have been listed by Fleck and Maennchen (1908-1912), Mi- 
rimanoff (1909), and Perron (1916). Instead, we concentrate only on 
the intended, but failed solutions, published in mathematical jour- 
nals or proceedings of conferences. 

First, we give a selection of books or papers containing references 
to wrong proofs. This is followed by a list of wrong papers, including 
an indication of where the mistake is discussed. 


I. Papers or Books Containing Lists of Wrong Proofs 


1908 Hoffmann, F., Der Satz vom Fermat. Sein seit dem Jahr 
1658 gesuchter Beweis, J. Singer, Strasbourg, 1908. 

1909 Lampe, E., Jahrbuch Fortschritte Math., 40 (1909), 258- 
261. 

1909/10/11/12/16 Fleck, A. and Maennchen, A., Vermeintliche 
SE des Fermatschen Satzes, Arch. Math. Phys., (3), 
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